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(57) Abstract: The present invention is directed to polypeptides that have enhanced alanine 2,3-aminomutase (AAM) activity and/or 
thermostability relative to the wild-type enzymes that have incidental AAM activity as a result of cross reactivity with alanine. In 
addition, the present invention is directed to a polynucleotides that encodes for the AAM polypeptides of the present invention, to 
nucleic acid sequences comprising the polynucleotides, to expression vectors comprising the polynucleotides operatively linked to 
a promoter, to host cells transformed to express the AAM polypeptides, and to a method for producing the AAM polypeptides of the 
present invention. 
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Attorney Docket No.0359.210WO/15686WO02 

IMPROVED ALANINE 2,3-AMINOMUTASES 
AND RELATED POLYNUCLEOTIDES 

FIELD OF THE INVENTION 
[01] The present invention is related to the field of enzymology, and particularly to 
the field of alanine 2,3-aminomutase (AAM) enzymology. More specifically, the 
present invention is directed to alanine 2,3-aminomutase polypeptides having 
improved enzymatic activity {i.e., high substrate turnover) and stability, and to 
polynucleotides sequences encoding for the improved alanine 2,3-aminomutase 
polypeptides. The present invention is useful because the alanine 2,3-aminomutase 
polypeptides can be coupled to other enzymes to produce synthetic organic chemicals, 
such as pantothenic acid or 3-hydroxypropionic acid in high yields. 

BACKGROUND OF THE INVENTION 
[02] Organic chemicals such as organic acids, esters, and polyols can be used to 
synthesize plastic materials and other products. To meet the increasing demand for 
organic chemicals, more efficient and cost-effective production methods are being 
developed which utilize raw materials based on carbohydrates rather than 
hydrocarbons. For example, certain bacteria have been used to produce large 
quantities of lactic acid used in the production of polylactic acid. 

[03] 3-hydroxypropionic acid (3 -HP) is an organic acid. Several chemical synthesis 
routes have been described to produce 3 -HP, and biocatalytic routes have also been 
disclosed (WO 01/16346 to Suthers et al.). 3-HP has utility for specialty synthesis 
and can be converted to commercially important intermediates by known methods in 
the chemical industry, e.g., acrylic acid by dehydration, malonic acid by oxidation, 
esters by esterification reactions with alcohols, and 1,3 -propanediol by reduction. 

[04] The compound 3-HP can be produced biocatalytically from PEP or pyruvate, 
through a key beta-alanine intermediate (FIG. 1). Beta-alanine can be synthesized in 
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cells from carnosine, beta-alanyl arginine, beta-alanyl lysine, uracil via 5,6- 
dihydrouracil and N-carbamoyl-beta-alanine, N-acetyl-beta-alanine, anserine, or 
aspartate. However, these routes are commercially unviable because they require rare 
precursors or starting compounds that are more valuable than 3 -HP. 
Therefore, production of 3 -HP using biocatalytic routes would be more efficient if 
alpha-alanine could be converted to beta-alanine directly (FIG. 1). Unfortunately, a 
naturally occurring enzyme that inter-converts alpha-alanine to beta-alanine has not 
yet been identified. It would be advantageous if enzymatic activities that carry out the 
conversion of alpha-alanine to beta-alanine were identified, such as an alanine 2,3- 
aminomutase. Accordingly, it is one object of the present invention to identify 
enzymes with improved alanine 2-3-aminomutase activity. 

[05] Lysine 2,3-aminomutase (KAM), which catalyzes the anaerobic 
interconversion of lysine to beta-lysine, was first described by Barker in Clostridium 
SB4 (now C. suhterminale) catalyzing the first step in the fermentation of lysine. 
KAM has been purified from C. suhterminale, the gene cloned and expressed in E. 
coli. See e.g., U.S. Pat. 6,248,874, which issued on June 19, 2001 to Frey et al, the 
whole of which is hereby incorporated herein by reference. The specific activity of 
purified KAM from C. suhterminale SB4 cells has been reported as 30-40 units/mg 
(Lieder et. al., Biochemistry 37:2578 (1998)), where a unit is defined as jtimoles 
lysine/min. The corresponding purified recombinantly produced KAM had equivalent 
enzyme activity (34.5 ± 1.6 |iimoles lysine/min/mg protein). See U.S. Patent 
Application Publication No. 2003/0113882 Al, which published on June 19, 2003 to 
Frey et al, the whole of which is incorporated herein by reference. 

[06] Based upon the sequence of the KAM from C. suhterminale, KAM genes have 
been annotated in the genomes of other organisms. However, in most cases, the 
enzymatic activities of the polypeptides encoded by these genes have not been 
confirmed. Exceptions are the B. subtilis gene (Chen, D., Ruzicka, FJ., and Frey, 
P.A. (2000) Biochem. J. 348:539-549)), and the Porphyromonas gingivalis and F. 
nucleatum genes. The B. subtilis KAM, encoded by the yodO gene, is more resistant 
to 0 2 than the C. suhterminale KAM, but it is markedly less active. As reported by 
Frey, the B. subtilis KAM has a specific activity of only 0.62 U/mg. 
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[07] C. subterminale SB4 KAM has been reported to have some cross-reactivity 
with L-alanine, converting it into beta-alanine. See U.S. Patent Application 
Publication No. 2003/0113882 Al. WO 03/062173 and WO 02/42418 disclose the 
first reports of AAM activity based upon modification of kam genes. In these 
applications, the synthetic aam genes had AAM activity as detected by the 
complementation of a ApanD E. coli strain. However, because alanine is not the 
natural substrate for this enzyme, the activity for this conversion is substantially less 
than the activity for conversion of lysine — its natural substrate. The AAM activity 
of a variant of B. suhtilis KAM that also had AAM activity at approximately 0.001 
U/mg. It is an object of the present invention to provide polynucleotides encoding a 
polypeptide having substantially enhanced AAM activity over that found in the wild- 
type enzymes. 
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SUMMARY OF THE INVENTION 
[08] The present invention has multiple aspects. In one aspect, the present 
invention is directed to polypeptides that catalyze the reaction of FIG. 1 . In one 
embodiment of this first aspect, the present invention is directed to a polypeptide 
having alanine 2,3-aminomutase (AAM) activity, preferably as measured by the assay 
of Example 8, and, 

(a) having a polypeptide selected from the group consisting of SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51; 

(b) having an amino acid sequence which has at least 98% homology, with the amino 
acid sequence selected from the group consisting of SEQ ID NO: 2, 22, 28, 32, and 
36; 

(c) having an amino acid sequence which has at least 99% homology, with the amino 
acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 12, 16, 24, 
26, 30, 34 and 40; 

(d) being a polypeptide encoded by a nucleic acid sequence which hybridizes under 
high stringency conditions with either (i) the nucleotide sequence of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 41, 43, 45, 47 or 49; 
(ii) a subsequence of (i) of at least 100 nucleotides, or (iii) a complementary strand of 
(i) or (ii) (J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A 
Laboratory Manual, 2d edition, Cold Spring Harbor, N.Y.); or 

(e) being a variant of the polypeptide of (c) comprising a substitution, deletion, and/or 
insertion of one to six amino acids therefrom and having AAM activity from about 1 
to about 30 \iM p-alanine produced /hour 1 cell OD at pH 7.0-7.6, 25°C. 

[09] Collectively, the polypeptides of (b) and (c) above are referred to herein as 
"homologous polypeptides." For purposes of the present invention, the degree of 
homology between two amino acid sequences is expressed as "percent homology," 
"percent identity," "% identity," "percent identical," and "% identical" are used 
interchangeably herein to refer to the percent amino acid sequence identity that is 
obtained by ClustalW analysis (version W 1.8 available from European 
Bioinformatics Institute, Cambridge, UK), counting the number of identical matches 
in the alignment and dividing such number of identical matches by the length of the 
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reference sequence, and using the following default ClustalW parameters to achieve 
slow/accurate pairwise optimal alignments - Gap Open Penalty: 10; Gap Extension 
Penalty:0.10; Protein weight matrix: Gonnet series; DNA weight matrix: 3UB; 
Toggle Slow/Fast pairwise alignments = SLOW or FULL Alignment. 

[10] In one embodiment, the present invention is also directed to an AAM 
polypeptide as described herein in isolated and purified form. 

[11] In another embodiment, the present invention is directed to an AAM 
polypeptide as described herein in lyophilized form. 

[12] In yet another embodiment, the present invention is directed to a composition 
comprising an AAM polypeptide as described herein and a suitable carrier, typically a 
buffer solution, more typically an aqueous buffer solution having a pH between 6.0 
and 8.0. The composition may also be in a lyophilized form. 

[13] The novel AAM polypeptides of the present invention have significantly 
enhanced AAM activity relative to the wild-type KAM polypeptides from which they 
are ultimately derived. By significantly enhanced AAM activity is meant that the 
AAM polypeptide of the present invention has an AAM activity within the range of 
about 1 to about 32 juM (3-alanine produced/hour 1 cell OD (units), preferably from 
about 10 to about 32 units, more preferably from about 20 to about 32 units; most 
preferably from about 25 to about 32 units. 

[14] Preferred AAM polypeptides of the present invention have an amino acid 
sequences of SEQ ID NOs: 2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 46 or 48; more 
preferably they have an amino acid sequence of SEQ ID NOs: 6, 12, 28, 34, 46 or 48; 
most preferably, they have an amino acid sequence of SEQ ID NOs: 28 or 34. 

[15] One of the grandparent molecules is the KAM of Bacillus subtilis, which had 
no detectible AAM activity. The DNA encoding this grandparent molecule was 
modified as described in WO 03/062173, entitled "Alanine 2,3-aminomutase," to 
produce a polypeptide having a detectible alanine 2,3-aminomutase activity. 

[16] In the present application, the applicants utilized as one parent molecule a 
polynucleotide sequence of SEQ ID NO: 58, which encoded the 471 residue 
polypeptide of SEQ ID NO: 59 and which exhibited an AAM activity of 
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approximately .001 U/mg (units/ mg of cell mass). The molecule of SEQ ID NO: 59 
differs from the wild-type B. subtilis KAM, which had no detectible A AM activity, by 
having the following four (4) amino acid substitutions: L103M, M136V, Y140H and 
D339H. 

[17] In yet another embodiment, the present invention is directed to a polypeptide 
having from about 1 to about 32 units of AAM activity and typically varying from the 
polypeptide of SEQ ID NO: 59 by 1-7 amino acid residues, more typically by 1-6 
amino acid residues, even more typically by 1-5 amino acid residues, and most 
typically by 1-4 amino acid residues. 

[18] In its second aspect, the present invention is directed to a polynucleotide 
sequence that encodes for the correspondingly referenced AAM polypeptide. Given 
the degeneracy of the genetic code, the present invention is also directed to any 
polynucleotide that encodes for the above referenced AAM polypeptides of the 
present invention. In another preferred embodiment, the present invention is directed 
to certain specific polynucleotides of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47 and 49 that encode for the novel 
AAM polypeptides of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51, respectively. Preferred polynucleotides 
encode for a polypeptide of SEQ ID NO: 2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 
46 or 48; more preferably they encode a polypeptide of SEQ ID NO: 6, 12, 28, 34, 46 
or 48; most preferably, they have a polypeptide of sequence of SEQ ID NO: 28 or 34. 

[19] In a third aspect, the present invention is directed to a nucleic acid construct, a 
vector, or a host cell comprising a polynucleotide sequence encoding an AAM 
polypeptide of the present invention operatively linked to a promoter. 

[20] In a fourth aspect, the present invention is directed to a method of making an 
AAM polypeptide of the present invention comprising (a) cultivating a host cell 
transformed with a nucleic acid sequence encoding an AAM polypeptide of the 
present invention under conditions suitable for production of the polypeptide; and (b) 
providing glucose to the cultivated host cells under conditions suitable for the 
production of P-alanine, The p-alanine may be optionally recovered from the cells. 
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[21] In a fifth aspect, the present invention is directed to a method of producing b- 
alanine comprising (a) cultivating a host cell transformed with a nucleic acid sequence 
encoding an AAM polypeptide of the present invention under conditions suitable for 
production of the polypeptide; and (b) providing glucose to the cultivated host cells 
under conditions suitable for the production of b-alanine. The b-alanine may be 
optionally recovered from the cells. 
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BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWINGS 
[22] FIG. 1 shows the reversible reaction between alpha-alanine {i.e., L-alanine or 
2-aminopropionic acid) and beta-alanine (3-aminopropionic acid) that is catalyzed by 
alanine 2,3-aminomutase. 

[23] FIG. 2 is a pathway for 3-hydroxypropionate (3 -HP) synthesis from alpha- 
alanine, via beta-alanine as an. intermediate. 

[24] FIG. 3 is a 4036 bp expression vector (pCKl 10900-1 Bla) of the present 
invention comprising a PI 5 A origin of replication (P15A ori), a lad repressor, a CAP 
binding site, a lac promoter (lac), a T7 ribosomal binding site (T7gl0 RBS), and a 
chloramphenicol resistance gene (camR). 

[25] FIGS. 4A-4J in combination provide an alignment chart of the amino acid 
sequences of four parental polypeptides that were used to produce the AAM of the 
present invention. The parental polypeptides were non-naturally occurring and 
derived in part from the KLAM of Clostrisium stricklandii (SEQ ID NO: 53), 
Porphyromonas gingivalis (SEQ ID NO: 55), Fusobacterium nucleatum (SEQ ID 
NO: 57), and Bacillus subtilis (SEQ ID NO: 59), respectively. The sequences of two 
wild-type KAM are disclosed in SEQ ID NOS: 60 (P GI2529467_G8_AAB81 159.1 J 

and 61 (PGI26343 61_EMB CAB 1 3 860 A_J. A consensus sequence is also provided 

as SEQ ID NO: 62). 

[26] The foregoing summary, as well as the following detailed description of 
certain embodiments of the present invention, will be better understood when read in 
conjunction with the appended drawings. For the purpose of illustrating the 
invention, there is shown in the drawings, certain embodiments. It should be 
understood, however, that the present invention is not limited to the arrangements and 
instrumentality shown in the attached drawings. 
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DETAILED DESCRIPTION OF THE INVENTION 
[27] The present invention has multiple aspects. In one aspect, the present 
invention is directed to a polypeptide having alanine 2,3-aminomutase (AAM) 
activity, preferably as measured by the assay of Example 8, and 

(a) having a polypeptide selected from the group consisting of SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51: 

(b) having an amino acid sequence which has at least 98% homology, with the amino 
acid sequence selected from the groxip consisting of SEQ ID NO: 2, 22, 28, 32, and 
36; 

(c) having an amino acid sequence which has at least 99% homology, with the amino 
acid sequence selected from the groxip consisting of SEQ ID NO: 4, 6, 8, 12, 16, 24, 
26, 30, 34 and 40; 

(d) being a polypeptide encoded by a nucleic acid sequence which hybridizes under 
high stringency conditions with either (i) the nucleotide sequence of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37,41,43, 45, 47 or 49; 
(ii) a subsequence of (i) of at least 1 00 nucleotides, or (iii) a complementary strand of 
(i) or (ii) (J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A 
Laboratory Manual, 2d edition, Cold Spring Harbor, N.Y.); or 

(e) being a variant of the polypeptide of (d) comprising a substitution, deletion, and/or 
insertion of one to six amino acids therefrom and having AAM activity from about 1 
to about 30 juM p-alanine produced /hour 1 cell OD atpH 7.0-7.6, 25°C. 

[28] Collectively, the polypeptides of (b) and (c) above are referred to herein as 
"homologous polypeptides." For purposes of the present invention, the degree of 
homology between two amino acid, sequences is expressed as "percent homology," 
"percent identity," "% identity," "percent identical," and "% identical" are used 
interchangeably herein to refer to the percent amino acid sequence identity that is 
obtained by ClustalW analysis (version W 1.8 available from European 
Bioinformatics Institute, Cambridge, UK), counting the number of identical matches 
in the alignment and dividing such number of identical matches by the length of the 
reference sequence, and using the following default ClustalW parameters to achieve 
slow/accurate pairwise optimal alignments - Gap Open Penalty: 10; Gap Extension 
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Penalty:0.10; Protein weight matrix: Gonnet series; DNA weight matrix: IUB; 
Toggle Slow/Fast pairwise alignments = SLOW or FULL Alignment. 

[29] AAM polypeptides are sensitive to oxygen and are preferably maintained and 
used in an oxygen deficient environment. If the AAM polypeptide becomes 
inactivated due to exposure to oxygen, it can be activated by anaerobic incubation 
with a sulfhydryl compound for one hour at 37°C in accordance with the method 
described in Chirpich, et al., Journal Biol. Chem., 245(7): 1778-1789 (1970), which is 
incorporated herein by reference in its entirety. AAM polypeptides of the present 
invention are preferably utilized in whole cell form (i.e., as a whole cell transformed 
with an AAM polynucleotide that is used under conditions such that the encoded 
AAM polypeptide is expressed in the cell) or alternatively, both isolated and utilized 
under anoxic conditions. AAM polypeptides of the present invention may be isolated, 
and optionally purified, under anaerobic conditions (e.g., under a nitrogen 
atmosphere) in accordance with the method described in Petrovich, et al., Journal 
Biol. Chem., 266(12):7656-7660 (1991), which describes the isolation and 
purification of lysine-2,3-aminomutase and which is incorporated herein by reference 
in its entirety. As used herein, the term "anoxic" refers to oxygen deficient. The 
AAM polypeptides in whole cell form or as isolated enzymes may be lyophilized. In 
yet another embodiment, the present invention is directed to a composition 
comprising an AAM polypeptide as described herein (e.g., in whole cell form or as an 
isolated polypeptide) and a suitable carrier, typically a buffer, more typically an 
aqueous buffer solution having a pH from about 6.0 to about 8.0. It is also within the 
scope of the present invention that the aqueous buffered composition be lyophilized to 
provide a composition in a lyophilized form, wherein the composition is reconstituted 
by the addition of an aqueous based composition. 

[30] In one embodiment, the present invention is also directed to an AAM 
polypeptide as described herein in isolated and purified form. 

[31] In another embodiment, the present invention is directed to an AAM 
polypeptide as described herein in lyophilized form. Lyophilization is performed 
using standard lyophilization equipment. Typically, a solution containing the 
polypeptide is dispensed in an appropriate sized vial, frozen and placed under reduced 
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pressure to cause the water to evaporate, leaving the lyophilized (freeze-dried) 
polypeptide behind. Prior to use, the lyophilized polypeptide is reconstituted with 
distilled water or an appropriate buffer solution. 

[32] In yet another embodiment, the present invention is directed to a composition 
comprising an AAM polypeptide as described herein and a suitable carrier, typically a 
buffer solution, more typically an aqueous buffer solution having a pH between 6.0 
and 8.0. The composition may also be in a lyophilized form. 

[33] The novel AAM polypeptides of the present invention have significantly 
enhanced AAM activity relative to the wild-type KAM polypeptides from which they 
are ultimately derived. By significantly enhanced AAM activity is meant that the 
AAM polypeptide of the present invention has an AAM activity within the range of 
about 1 to about 32 juM P-alanine produced/hour 1 cell OD (units), preferably from 
about 10 to about 32 units, more preferably from about 20 to about 32 units; most 
preferably from about 25 to about 32 units. 

[34] Table 1 provides a chart showing the AAM activities of the various AAM 
polypeptides of the present invention, identified by their clone number and SEQ ID 
NO. In Table 1, the OD 60 0nm is reported at harvest after 5 hours (t=5) of incubation. 
Table 1 also reports the total |uM of p-alanine produced after 5 hours per 1 cell OD. 
Finally, the last column of Table 1 reports the rate of p-alanine (|llM) produced/hr /l 
cell OD. 
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Table 1 
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32 
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17.5 
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6 
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[35] Preferred AAM polypeptides of the present invention have an amino acid 
sequences of SEQ ID NOs: 2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 46 or 48; more 
preferably they have an amino acid sequence of SEQ ID NOs: 6, 12, 28, 34, 46 or 48; 
most preferably, they have an amino acid sequence of SEQ ID NOs: 28 or 34. 

[36] The ultimate grandparent molecule is the KAM of Bacillus subtilis, which had 
no detectible AAM activity. The DNA encoding this grandparent molecule was 
modified as described in WO 03/062173, entitled "Alanine 2,3-aminomutase," to 
produce a polypeptide having a detectible alanine 2,3-aminomutase activity. 
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[37] In the present application, the applicants utilized as one parent molecule a 
polynucleotide of SEQ ID NO: 58, which encoded the 471 residue polypeptide of 
SEQ ID NO: 59 and which exhibited an AAM activity of approximately .001 U/mg 
(units/ mg of cell mass). The molecule of SEQ ID NO: 59 differs from the wild-type 
B. subtilis KAM (SEQ ID NO: 60), which had no detectible AAM activity, having 
the following four (4) amino acid substitutions: L103M, M136V, Y140H and D339H. 

[38] Other grandparent molecules utilized as starting materials in the present 
invention were the DNA sequences from other microorganisms (e.g., Porphyromonas 
gingivalis, Fusobacterium nucleatum, and Clostridium sticklandii) that erxcoded a 
KAM polypeptide. These DNA sequences were modified using standard techniques 
to introduce point substitutions that ultimately produced a KAM polypeptide that also 
had a detectible cross-reactivity with a-alanine. One such parent molecule that was 
derived from Porphyromonas gingivalis is the polynucleotide of SEQ ID NO: 54 
which encodes the 416 residue polypeptide of SEQ ID NO: 55. The parental 
polypeptide of SEQ ID NO: 55 differs from the wild-type Porphyromonas gingivalis 
KAM by having the following seven (7) amino acid substitutions: N19\T, E30K, 
L53P, H85Q, I192V, D331G, and M342T. Another such parent molecule that was 
derived from F. nucleatum is the polynucleotide of SEQ ID NO: 56 which encodes 
the 425 residue polypeptide of SEQ ID NO: 57. 

[39] Yet another parent polynucleotide was derived by modification of the 
polynucleotide in C. stricklandii that encodes KAM. The resulting parental 
polynucleotide, which has a detectable cross-reactivity with a-alanine,, is the 
polynucleotide of SEQ ID NO: 52 which encodes the 416 residue polypeptide of SEQ 
ID NO: 53. 

[40] The above described parental polypeptides of SEQ ID NOs: 53, 55, 57 and 58 
are compared in the alignment chart of FIG. 4. From the alignment chart, it can be 
seen that the KAMs from P. gingivalis, C. stricklandii, and F. nucleatum are truncated 
at the N-terminus and at the C-terminus relative to the KAM from B. subtilis, while 
between the four species, about 40% of the residue positions in the central portion of 
the KAM polypeptide are conserved. Based upon the truncated species in the 
alignment chart of FIG. 4, it can be inferred that the first 8 amino acid residues at the 
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N-terminus of SEQ ID NO: 58 and the last 40 residues at the C-terminus of SEQ ID 
NO: 58 are not necessary for KAM activity, or the AAM activity that is derived 
therefrom. In FIG. 4, there is also provided a consensus sequence. 

[41] The AAM polypeptide molecules of the present invention with their enhanced 
AAM activity were made by applying directed evolution techniques to the above- 
described parental molecules. These techniques are described in further detail herein. 

[42] In yet another aspect, the present invention is directed to AAM polypeptides 
that have enhanced activity in coupled reactions. 

[43] In another embodiment, the present invention is directed to an AAM a 
polypeptide encoded by a nucleic acid sequence which hybridizes under high 
stringency conditions with either (i) the nucleotide sequence of SEQ ID NO: 1, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23,25,27,29, 31, 33, 35, 37,41,43,45,47 or 49; (ii) a 
subsequence of (i) of at least 100 nucleotides, or (iii) a complementary stran<i of (i) or 
(ii) (J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A 
Laboratory Manual, 2d edition, Cold Spring Harbor, N.Y.). For polynucleotides of at 
least 100 nucleotides in length, low to very high stringency conditions are defined as 
prehybridization and hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 jag/ml sheared 
and denatured salmon sperm DNA, and either 25% formamide for low stringencies, 
35% formamide for medium and medium-high stringencies, or 50% formamide for 
high and very high stringencies, following standard Southern blotting procedures. 

[44] For polynucleotides of at least 100 nucleotides in length, the carrier material is 
finally washed three times each for 15 minutes using 2x SSC, 0.2% SDS at least at 
50°C (low stringency), at least at 55°C (medium stringency), at least at 60°C. 
(medium-high stringency), at least at 65°C (high stringency), and at least at 70°C. 
(very high stringency). 

[45] In another embodiment, the present invention is directed to a variant of the 
polypeptide of (d) comprising a substitution, deletion, and/or insertion of one to six 
amino acids there-from and having AAM activity from about 1 to about 30 pM P- 
alanine produced /hour 1 cell OD at pH 7.0-7.6, 25°C, such as determined by the 
method of Example 8. Preferably, amino acid changes are of a minor natuLre, that is 
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conservative amino acid substitutions that do not significantly affect the folding 
and/or activity of the protein; small deletions, typically of one to si;x amino acids; 
small amino- or carboxyl-terminal extensions; a small linker peptide; or a small 
extension that facilitates purification by changing net charge or another function, such 
as a poly-histidine tract, an antigenic epitope or a binding domain. 

[46] Examples of conservative substitutions are within the group of basic amino 
acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic 
acid), polar amino acids (glutamine and asparagine), hydrophobic amino acids 
(leucine, isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and 
tyrosine), and small amino acids (glycine, alanine, serine, threonine, proline, cysteine 
and methionine). Amino acid substitutions, which do not generally alter the specific 
activity are known in the art and are described, for example, by H. Nemrath and R. L. 
Hill, 1979, In, The Proteins, Academic Press, New York. The most commonly 
occurring exchanges are Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, 
Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, 
Ala/Glu, and Asp/Gly as well as these in reverse. 

[47] In another embodiment, the present invention is directed to a fragment of (a), 
(b) or (c), as described above in the first paragraph of the Detailed Description, that 
has from about 1 to about 30 jliM p-alanine produced /hour 1 cell OE> at pH 7.0-7.6, 
25°C, such as determined by the method of Example 8. By the term "fragment" is 
meant that the polypeptide has a deletion of 1 to 8 amino acid residixes from the N- 
terminus or 1-40 residues from the C-terminus, or both. Preferably, tlie deletion is 1 
to 20 residues from the C-terminus, more preferably, the deletion is 1 to 10 residues 
from the C-terminus. 

Polynucleotides 

[48] In its second aspect, the present invention is directed to a polynucleotide 
sequence that encodes for an AAM polypeptide of the present invention. Given the 
degeneracy of the genetic code, the present invention is also directed to any 
polynucleotide that encodes for the above referenced AAM polypeptides of the 
present invention. In its second aspect, the present invention is directed to a 
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polynucleotide sequence that encodes for the correspondingly referenced AAM 
polypeptide. Given the degeneracy of the genetic code, the present invention is also 
directed to any polynucleotide that encodes for the above referenced AAM 
polypeptides of the present invention. In a preferred embodiment, the present 
invention is directed to certain specific polynucleotides of SEQ ID NOS: 1, 3, 5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47 and 49 that 
encode for the novel AAM polypeptides of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51, respectively. 
Preferred polynucleotides encode for a polypeptide of SEQ ID NO: 2, 6, 12, 16, 20, 
24, 28, 30, 32, 34, 38, 44, 46 or 48; more preferably they encode a polypeptide of 
SEQ ID NO: 6, 12, 28, 34, 46 or 48; most preferably, they have a polypeptide of 
sequence of SEQ ID NO: 28 or 34. 

[49] To make the improved AAM polypeptides of the present invention, one starts 
with one or more wild-type polynucleotides that encode a KAM polypeptide. The 
term "wild-type" polynucleotide means that the nucleic acid fragment does not 
comprise any mutations from the form isolated from nature. The term "wild-type" 
protein means that the protein will be active at a level of activity found in nature and 
typically will comprise the amino acid sequence as found in nature. Thus, the term 
"wild type" or "grand-parent sequence" indicates a starting or reference sequence 
prior to a manipulation of the invention. 

[50] Suitable sources of wild-type KAM as a starting material to be improved is 
readily identified by screening genomic libraries for the KAM activity. A particularly 
suitable source of KAM is the yodO gene of Bacillus sp. bacteria as found in nature. 
Using the published KAM gene sequences for B. subtilis {e.g., WO 03 0623173 A2), 
primers for amplification of the genes from their respective gene libraries were 
created using conventional techniques. One such technique for isolating the KAM of 
B. subtilis is disclosed in Chen et ah, "A novel lysine 2,3-aminomutase encoded by 
the yodO gene of Bacillus subtilis: characterization on observation of organic radical 
intermediates," Biochem J. 348:539-549 (2000), which is incorporated herein by 
reference. 
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[51] The starting polynucleotides of SEQ ID NOs: 52, 54, 56 and 58 were obtained 
using the techniques discloses in WO 03 0623173 A2 which is incorporated herein by 
reference for the disclosure of those techniques as recited in the examples therein. 
Specifically, WO 03 0623173 A2 discloses a B. suhtilis wild-type lysine 2,3- 
aminomutase (KAM), and a mutated form thereof, which encodes an alanine 2,3- 
aminomutase (AAM). In addition, WO 03 0623173 A2 also discloses a P. gingivalis 
wild-type lysine 2,3-aminomutase (KAM) and a mutated form thereof, which encodes 
an alanine 2,3-aminomutase (AAM). 

[52] Beginning with the polynucleotide of SEQ ID NO: 58, a non-naturally 
occurring and mutated and/or evolved enzyme, having unknown AAM activity is 
generated using any one of the well-known mutagenesis or directed evolution 
methods. See, e.g., Ling, et al., "Approaches to DNA mutagenesis: an overview," 
Anal. Biochem. , 254(2): 157-78 (1997); Dale, et al., "Oligonucleotide-directed 
random mutagenesis using the phosphorothioate method," Methods Mol. Biol. , 
57:369-74 (1996); Smith, "In vitro mutagenesis," Ann. Rev. Genet. , 19:423-462 

(1985) ; Botstein, et al., " Strategies and applications of in vitro mutagenesis," Science , 
229:1193-1201 (1985); Carter, "Site-directed mutagenesis," Biochem. J. , 237:1-7 

(1986) ; Kramer, et al., "Point Mismatch Repair," Cell, 38:879-887 (1984); Wells, et 
al., "Cassette mutagenesis: an efficient method for generation of multiple mutations at 
defined sites," Gene , 34:315-323 (1985); Minshull, et al., "Protein evolution by 
molecular breeding," Current Opinion in Chemical Biology , 3:284-290 (1999); 
Christians, et al., "Directed evolution of thymidine kinase for AZT phosphorylation 
using DNA family shuffling," Nature Biotechnology , 17:259-264 (1999); Crameri, et 
al., "DNA shuffling of a family of genes from diverse species accelerates directed 
evolution," Nature , 391:288-291; Crameri, et al., "Molecular evolution of an arsenate 
detoxification pathway by DNA shuffling," Nature Biotechnology , 15:436-438 
(1997); Zhang, et al., "Directed evolution of an effective fucosidase from a 
galactosidase by DNA shuffling and screening," Proceedings of the National 
Academy of Sciences, U.S.A. , 94:45-4-4509; Crameri, et al., "Improved green 
fluorescent protein by molecular evolution using DNA shuffling," Nature 
Biotechnology < 14:315-319 (1996); Stemmer, "Rapid evolution of a protein in vitro 
by DNA shuffling," Nature , 370:389-391 (1994); Stemmer, "DNA shuffling by 
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random fragmentation and reassembly: In vitro recombination for molecular 
evolution," Proceedings of the National Academy of Sciences, "U.S.A. , 91:10747- 
10751 (1994); WO 95/22625; WO 97/0078; WO 97/35966; 98/27230; WO 

00/42651; WO 01/75767 and U.S. Pat. 6,537,746 which issued to Arnold, et al. on 
March 25, 2003 and is entitled "Method for creating polynucleotide and polypeptide 
sequences." 

[53J Any of these methods can be applied to generate AAM polynucleotides. To 
maximize any diversity, several of the above-described techniques can be used 
sequentially. Typically, a library of shuffled polynucleotides is created by one 
mutagenic or evolutionary technique and their expression products are screened to 
find the polypeptides having the highest AAM activity. Then, a second mutagenic or 
evolutionary technique is applied to polynucleotides encoding the most active 
polypeptides to create a second library, which in turn is screened for AAM activity by 
the same technique. The process of mutating and screening can be repeated as many 
times as needed, including the insertion of point mutations, to arrive at a 
polynucleotide that encodes a polypeptide with the desired activity, thermostability, or 
cofactor preference. 

[54] Alternatively, polynucleotides and oligonucleotides of the? invention can be 
prepared by standard solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized, then 
joined (e.g., by enzymatic or chemical litigation methods, or polymerase mediated 
methods) to form essentially any desired continuous sequence. For example, 
polynucleotides and oligonucleotides of the invention can be prepared by chemical 
synthesis using, e.g., the classical phosphoramidite method described by Beaucage et 
al. (1981) Tetrahedron Letters 22:1859-69, or the method described by Matthes et al. 
(1984) EMBO J. 3:801-05, e.g., as it is typically practiced in automated synthetic 
methods. According to the phosphoramidite method, oligonucleotides are 
synthesized, e.g., in an automatic DNA synthesizer, purified, annealed, ligated and 
cloned in appropriate vectors. 

[55] In addition, essentially any nucleic acid can be custom ordered from any of a 
variety of commercial sources, such as The Midland Certified Reagent Company, 
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Midland, TX, The Great American Gene Company, Ramona, CA, ExpressGen Inc., 
Chicago, IL, Operon Technologies Inc., Alameda, CA, all of which have internet web 
sites, and many others. Similarly, peptides and antibodies can be custom ordered 
from any of a variety of sources, such as PeptidoGenic, HTI Bio-products, Inc., BMA 
Biomedicals Ltd. (U.K.), Bio. Synthesis, Inc., and many others. 

[56] Polynucleotides may also be synthesized by well-known tecliniques as 
described in the technical literature. See, e.g., Carruthers et al, Cold Spring Harbor 
Symp. Quant Biol. 47:411-418 (1982), and Adams et al, J. Am. Chem. Soc. 105:661 
(1983). Double stranded DNA fragments may then be obtained either by synthesizing 
the complementary strand and annealing the strands together under appropriate 
conditions, or by adding the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

[57] General texts which describe molecular biological techniques useful herein, 
including mutagenesis, include Berger and Kimmel, Guide to Molecular Cloning 
Techniques, Methods in Enzymology , volume 152 Academic Press, Inc., San Diego, 
CA ("Berger"); Sambrook et al 9 Molecular Cloning - A Laboratory Manual (2nd Ed.), 
volumes 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989 
("Sambrook"); and Current Protocols in Molecular Biology , F.M. AusubeZ et al 9 eds., 
Current Protocols, a joint venture between Greene Publishing Associates, Inc. and 
John Wiley & Sons, Inc. (supplemented through 2000) ("Ausubel")). Examples of 
techniques sufficient to direct persons of skill through in vitro amplification methods, 
including the polymerase chain reaction (PGR) the ligase chain reaction (LCR), QP~ 
replicase amplification and other RNA polymerase mediated techniques (e.g., 
NASBA) are found in Berger, Sambrook, and Ausubel, as well as Mullis ezt al 9 (1987) 
U.S. Patent No. 4,683,202; PGR Protocols A Guided to Methods and Applications 
(Innis et al, eds.) Academic Press Inc. San Diego, CA (1990); Arnheim & Levinson 
(October 1, 1990) Chemical and Engineering News 36-47; The Journal Of NIH 
Research (1991) 3:81-94; Kwoh et al (1989) Proc. NatL Acad. Sci. USA 86:1173; 
Guatelli et al (1990) Proc. NatL Acad. Sci. USA 87:1874; Lomell et al. (1989) J. 
Clin. Chem. 35:1826; Landegren et al, (1988) Science 241:1077-1080; Van Brunt 
(1990) Biotechnology 8:291-294; Wu and Wallace, (1989) Gene 4:560; JBarringer et 
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al (1990) Gene 89:117, and Sooknanan and Malek (1995) Biotechnology 13:563- 
564. Improved methods of cloning in vitro amplified nucleic acids are described in 
Wallace et al, U.S. Pat. No. 5,426,039. Improved methods of amplifying large 
nucleic acids by PCR are summarized in Cheng et ah (1994) Nature 369:684-685 and 
the references therein, in which PCR amplicons of up to 40kb are generated. One of 
skill will appreciate that essentially any RNA can be converted into a double stranded 
DNA suitable for restriction digestion, PCR expansion and sequencing using reverse 
transcriptase and a polymerase. See, Ausubel, Sambrook and Berger, all supra. 

[58] It will be appreciated by those skilled in the art due to the degeneracy of the 
genetic code, a multitude of nucleotide sequences encoding AAM polypeptides of the 
invention may be produced, some of which bear substantial identity to the nucleic 
acid sequences explicitly disclosed herein. It is also within the scope of the present 
invention that the polynucleotides encoding the AAM polypeptides of the present 
invention may be codon optimized for optimal production from the host organism 
selected for expression. Those having ordinary skill in the art will recognize that 
tables and other references providing codon preference information for a wide range 
of organisms are readily available. See e.g., Henaut and Danchin, "Escherichia coli 
and Salmonella;' Neidhardt, et al. Eds., ASM Press, Washington D.C., p. 2047-2066 
(1996). 

[59] It is to be noted that expression in E. coli is different than in other organisms. 
For example, in the present invention, the codon (tgg) encodes Trp (W) for residue 
position 31 in the parent polypeptide of SEQ ID NO: 59. However, the corresponding 
codon for residue position 3 1 is "tga" in each of the progeny polynucleotides of SEQ 
ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 
45, and 47 encoding for the AAM polypeptides of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, and 48, respectively. One 
skilled in the art recognizes that the codon "tga" is usually a stop (nonsense) codon. 
However, in the present expression system used in the ApanD E. coli strain, and under 
the selection conditions imposed, this codon is read through by the E. coli as a sense 
codon and is expressed, presumably as Trp (W). Others have reported that "tga" is 
the weakest stop codon for E. coli and that it is often read through as a sense codon 
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for Trp (W) in high expression. See e.g., Parker, J., "Errors and Alternatives in 
Reading the universal Genetic Code," Microbiological Reviews, 53(3): 273-298 
(1989); Roth, J., "UGA Nonsense Mutations in Salmonella typhimurium" J. of 
Bacteriology, 102(2):467~475 (1970); and McBeath, G. and Kast, P., "UGA Read- 
Through Artifacts — When Popular Gene Expression Systems Need a Patch," 
BioTechniques, 24:789-794 (May 1998), which are incorporated herein by reference. 
Hence for expression in non-i?. coli systems, it would be advantageous to alter the 
codon (tga) at residue position 31 to "tgg" which is tlxe universal sense codon for Trp 
(W). 

[60] In SEQ ID NO: 49, the codon encoding for residue 72 is "tag" which is read as 
a stop codon. However, two fragments are produced. The first fragment, having 
residues 1-71 of SEQ ID NO: 50, does not have any detectable AAM activity. The 
second fragment that is produced begins with residixe 73 (Val) instead of the usual 
Met. This second fragment has 399 residues (SEQ ID NO: 51) and does have 
significant AAM activity (see Table 2) based upon the assay of Example 8. Thus, the 
first 72 residues at the N-terminus of the AAM polypeptide (based upon the 
consensus sequence or the parental KAM sequence from B. subtilis) are not 
absolutely necessary for AAM activity. 

[61] In the present case, several round No. 1 libraries were created by applying a 
variety of mutagenic techniques to the polynucleotides of SEQ ID NOs: 52, 54, 56 
and 58. 

[62] In its third aspect, the present invention is directed to an expression vector and 
to a host cell comprising a polynucleotide of the present invention operatively linked 
to a control sequence. To obtain expression of the ^variant gene encoding an AAM 
polypeptide, the variant gene was first operatively lirLked to one or more heterologous 
regulatory sequences that control gene expression to create a nucleic acid construct, 
such as an expression vector or expression cassette. Thereafter, the resulting nucleic 
acid construct, such as an expression vector or expression cassette, was inserted into 
an appropriate host cell for ultimate expression of tlxe AAM polypeptide encoded by 
the shuffled gene. A "nucleic acid construct" is defined herein as a nucleic acid 
molecule, either single-or double-stranded, which, is isolated from a naturally 



WO 2006/047589 



PCT/US2005/038552 



-22- 

occurring gene or which has been modified to contain segments of nucleic acid 
combined and juxtaposed in a manner that would not otherwise exist in nature. Thus, 
in one aspect, the present invention is directed to a nucleic acid construct comprising 
a polynucleotide encoding an AAM polypeptide of the present invention. 

[63] The term "nucleic acid construct" is synonymous with the term "expression 
cassette" when the nucleic acid construct contains all the control sequences required 
for expression of a coding sequence of the present invention. The term "coding 
sequence" is defined herein as a nucleic acid sequence, which directly specifies the 
amino acid sequence of its protein product. A coding sequence can include, but is not 
limited to, DNA, cDNA, and recombinant nucleic acid sequences. 

[64] An isolated polynucleotide encoding an AAM polypeptide of the present 
invention may be manipulated in a variety of ways to provide for expression of the 
polypeptide. Manipulation of the isolated polynucleotide prior to its insertion into a 
vector may be desirable or necessary depending on the expression vector. The 
techniques for modifying polynucleotides and nucleic acid sequences utilizing 
recombinant DNA methods are well known in the art. 

[65] The term "control sequence" is defined herein to include all components, 
which are necessary or advantageous for the expression of a polypeptide of the 
present invention. Each control sequence may be native or foreign to the nucleic acid 
sequence encoding the polypeptide. Such control sequences include, but are not 
limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal 
peptide sequence, and transcription terminator. At a minimum, the control sequences 
include a promoter, and transcriptional and translational stop signals. The control 
sequences may be provided with linkers for the purpose of introducing specific 
restriction sites facilitating ligation of the control sequences with the coding region of 
the nucleic acid sequence encoding a polypeptide. 

[66] The term "operably linked" is defined herein as a configuration in which a 
control sequence is appropriately placed at a position relative to the coding sequence 
of the DNA sequence such that the control sequence directs the expression of a 
polypeptide. 
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[67] The control sequence may be an appropriate promoter sequence. The 
"promoter sequence" is a relatively short nucleic acid sequence that is recognized by a 
host cell for expression of the longer coding region that follows. The promoter 
sequence contains transcriptional control sequences, which mediate the expression of 
the polypeptide. The promoter may be any nucleic acid sequence which shows 
transcriptional activity in the host cell of choice including mutant, truncated, and 
hybrid promoters, and may be obtained from genes encoding extracellular or 
intracellular polypeptides either homologous or heterologous to the host cell. 

[68] For bacterial host cells, suitable promoters for directing the transcription of the 
nucleic acid constructs of the present invention, include the promoters obtained from 
the E. coli lac operon , Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis 
levansucrase gene (sacB), Bacillus licheniformis alpha-amylase gene (arnyL), Bacillus 
stearothermophihis maltogenic amylase gene (amyM), Bacillus amyloliqiiefaciens 
alpha-amylase gene (amyQ), Bacillus licheniformis penicillinase gene (penP), 
Bacillus subtilis xylA and xylB genes, and prokaryotic beta-lactamase gene (Villa- 
Kamaroff et al., 1978, Proceedings of the National Academy of Sciences USA 75: 
3727-3731), as well as the tac promoter (DeBoer et al., 1983, Proceedings of the 
National Academy of Sciences USA 80: 21-25). Further promoters are described in 
"Useful proteins from recombinant bacteria" in Scientific American, 1980, 242: 74- 
94; and in Sambrook et al., 1989, supra. 

[69] For filamentous fungal host cells, suitable promoters for directing the 
transcription of the nucleic acid constructs of the present invention include promoters 
obtained from the genes for Aspergillus oiyzae TAKA amylase, Rhizomucor miehei 
aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid 
stable alpha-amylase, Aspergillus niger or Aspergillus awamori glucoarnylase (glaA), 
Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae 
triose phosphate isomerase, Aspergillus nidulans acetamidase, and Fusarium 
oxysporum trypsin-like protease (WO 96/00787), as well as the NA2-tpi promoter (a 
hybrid of the promoters from the genes for Aspergillus niger neutral alpha-amylase 
and Aspergillus oryzae triose phosphate isomerase), and mutant, truncated, and hybrid 
promoters thereof. 
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[70] In a yeast host, useful promoters are obtained from the genes for 
Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase 
(GAL1 ), Saccharomyces cerevisiae alcohol dehydrogenase/ glyceraldehy de-3 - 
phosphate dehydrogenase (ADH2/GAP), and Saccharomyces cerevisiae 3- 
phosphoglycerate kinase. Other useful promoters for yeast host cells are described by 
Romanos et al., 1992, Yeast 8:423-488. 

[71] The control sequence may also be a suitable transcription terminator sequence, 
a sequence recognized by a host cell to terminate transcription. The terminator 
sequence is operably linked to the 3' terminus of the nucleic acid sequence encoding 
the polypeptide. Any terminator, which is functional in the host cell of choice, may be 
used in the present invention. 

[72] Preferred terminators for filamentous fungal host cells are obtained from the 
genes for Aspergillus oiyzae TAKA amylase, Aspergillus niger glucoamylase, 
Aspergillus nidulans anthranilate synthase, Aspergillus niger alpha-glucosidase, and 
Fusarium oxysporum trypsin-like protease. 

[73] Preferred terminators for yeast host cells are obtained from the genes for 
Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), 
and Saccharomyces cerevisiae glyceraldehyde-3 -phosphate dehydrogenase. Other 
useful terminators for yeast host cells are described by Romanos et al., 1992, supra. 

[74] The control sequence may also be a suitable leader sequence, a nontran slated 
region of an mRNA which is important for translation by the host cell. The leader 
sequence is operably linked to the 5' terminus of the nucleic acid sequence encoding 
the polypeptide. Any leader sequence that is functional in the host cell of choice may 
be used in the present invention. Preferred leaders for filamentous fungal host cells 
are obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus 
nidulans triose phosphate isomerase. Suitable leaders for yeast host cells are obtained 
from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces 
cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha- factor, and 
Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate 
dehydrogenase (ADH2/GAP). 
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[75] The control sequence may also be a polyadenylation sequence, a sequence 
operably linked to the 3' terminus of the nucleic acid sequence and which, when 
transcribed, is recognized by the host cell as a signal to add polyadenosine residues to 
transcribed mRNA. Any polyadenylation sequence that is functional in the host cell 
of choice may be used in the present invention. Preferred polyadenylation sequences 
for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae 
TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate 
synthase, Fusarium oxysporum trypsin-like protease, and Aspergillus niger alpha- 
glucosidase. Useful polyadenylation sequences for yeast host cells are described by 
Guo and Sherman, 1995, Molecular Cellular Biology 15: 5983-5990. 

[76] The control sequence may also be a signal peptide coding region that codes for 
an amino acid sequence linked to the amino terminus of a polypeptide and directs the 
encoded polypeptide into the cell's secretory pathway. The 5' end of the coding 
sequence of the nucleic acid sequence may inherently contain a signal peptide coding 
region naturally linked in translation reading frame with the segment of the coding 
region that encodes the secreted polypeptide. Alternatively, the 5* end of the coding 
sequence may contain a signal peptide coding region that is foreign to the coding 
sequence. The foreign signal peptide coding region may be required where the coding 
sequence does not naturally contain a signal peptide coding region. 

[77] Alternatively, the foreign signal peptide coding region may simply replace the 
natural signal peptide coding region in order to enhance secretion of the polypeptide. 
However, any signal peptide coding region that directs the expressed polypeptide into 
the secretory pathway of a host cell of choice may be used in the present invention. 

[78] Effective signal peptide coding regions for bacterial host cells are the signal 
peptide coding regions obtained from the genes for Bacillus NC1B 11837 maltogenic 
amylase, Bacillus stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin, 
Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus neutral proteases 
(nprT, nprS, nprM), and Bacillus subtilis prsA. Further signal peptides are described 
by Simonen and Palva, 1993, Microbiological Reviews 57: 109-137. 

[79] Effective signal peptide coding regions for filamentous fungal host cells are 
the signal peptide coding regions obtained from the genes for Aspergillus oryzae 
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TAKA amylase, Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, 
Rhizomucor miehei aspartic proteinase, Humicola insolens cellulase, and Humicola 
lanuginosa lipase. 

[80] Useful signal peptides for yeast host cells are obtained from the genes for 
Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. 
Other useful signal peptide coding regions are described by Romanes et ah, 1992, 
supra. 

[81] The control sequence may also be a propeptide coding region that codes for an 
amino acid sequence positioned at the amino terminus of a polypeptide. The resultant 
polypeptide is known as a proenzyme or prop olypep tide (or a zymogen in some 
cases). A propolypeptide is generally inactive and can be converted to a mature active 
polypeptide by catalytic or autocatalytic cleavage of the propeptide from the 
propolypeptide. The propeptide coding region may be obtained from the genes for 
Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), 
Saccharomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic proteinase, and 
Myceliophthora thermophila lactase (WO 95/33836). 

[82] Where both signal peptide and propeptide regions are present at the amino 
terminus of a polypeptide, the propeptide region is positioned next to the amino 
terminus of a polypeptide and the signal peptide region is positioned next to the amino 
terminus of the propeptide region. 

[83] It may also be desirable to add regulatory sequences, which allow the 
regulation of the expression of the polypeptide relative to the growth of the host cell. 
Examples of regulatory systems are those which cause the expression of the gene to 
be turned on or off in response to a chemical or physical stimulus, including the 
presence of a regulatory compound. In prokaryotic host cells, suitable regulatory 
sequences include the lac, tac, and trp operator systems. In yeast host cells, suitable 
regulatory systems include the ADH2 system or GAL1 system. In filamentous fungi, 
suitable regulatory sequences include the TAKA alpha-amylase promoter, Aspergillus 
niger glucoamylase promoter, and Aspergillus oryzae glucoamylase promoter. 
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[84] Other examples of regulatory sequences are those which allow for gene 
amplification. In eukaryotic systems, these include the dihydrofolate reductase gene, 
which is amplified in the presence of methotrexate, and tine metallothionein genes, 
which are amplified with heavy metals. In these cases, tlie nucleic acid sequence 
encoding the AAM polypeptide of the present invention would be operably linked 
with the regulatory sequence. 

Expression Vectors 

[85] In another aspect, the present invention is also directed to a recombinant 
expression vector comprising a polynucleotide of the present invention (which 
encodes an AAM polypeptide of the present invention), aixcl one or more expression 
regulating regions. An expression regulating region includes a promoter, a 
terminator, a replication origin, etc., depending on the type? of hosts into which they 
are to be introduced. The various nucleic acid and control s equences described above 
may be joined together to produce a recombinant expression vector which may 
include one or more convenient restriction sites to allow for insertion or substitution 
of the nucleic acid sequence encoding the polypeptide at such sites. Alternatively, the 
nucleic acid sequence of the present invention may be expressed by inserting the 
nucleic acid sequence or a nucleic acid construct comprising the sequence into an 
appropriate vector for expression. In creating the expression vector, the coding 
sequence is located in the vector so that the coding sequence is operably linked with 
the appropriate control sequences for expression. 

[86] The recombinant expression vector may be any v^ector (e.g., a plasmid or 
virus), which can be conveniently subjected to recombinant DNA procedures and can 
bring about the expression of the polynucleotide sequence. The choice of the vector 
will typically depend on the compatibility of the vector with the host cell into which 
the vector is to be introduced. The vectors may be linear or closed circular plasmids. 

[87] The expression vector may be an autonomously replicating vector, i.e., a 
vector that, exists as an extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g., a plasmid, an extrachromosomal 
element, a minichromosome, or an artificial chromosome. The vector may contain 
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any means for assuring self-replication. Alternatively, the vector may be one which, 
when introduced into the host cell, is integrated into the genorrxe and replicated 
together with the chromosome(s) into which it has been integrated- Furthermore, a 
single vector or plasmid or two or more vectors or plasmids which, together contain 
the total DNA to be introduced into the genome of the host cell, or a transposon may 
be used. 

[88] The expression vector of the present invention preferably contains one or more 
selectable markers, which permit easy selection of transformed cells. A selectable 
marker is a gene the product of which provides for biocide or viral resistance, 
resistance to heavy metals, prototrophy to auxotrophs, and the lilce. Examples of 
bacterial selectable markers are the dal genes from Bacillus siibtilis or Bacillus 
licheniformis, or markers, which confer antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol (Example 1) or tetracycline resistance. Suitable markers 
for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. 

[89] Selectable markers for use in a filamentous fungal host cell include, but are 
not limited to, amdS (acetamidase), argB (ornithine carbamoyltransferase), bar 
(phosphinothricin acetyltransferase), hph (hygromycin phosphotransferase), niaD 
(nitrate reductase), pyrG (orotidine-5-phosphate decarboxylase), (sulfate 
adenyltransferase), and trpC (anthranilate synthase), as well as equivalents thereof. 
Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergillus 
nidulans or Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus. 

[90] The vectors of the present invention preferably contain axi element(s) that 
permits integration of the vector into the host cell's genome or autonomous replication 
of the vector in the cell independent of the genome. For integration into the host cell 
genome, the vector may rely on the nucleic acid sequence encoding the polypeptide or 
any other element of the vector for integration of the vector into the genome by 
homologous or nonhomologous recombination. 

[91] Alternatively, the vector may contain additional nucleic acid sequences for 
directing integration by homologous recombination into the genome of the host cell. 
The additional nucleic acid sequences enable the vector to be integrated into the host 
cell genome at a precise location(s) in the chromosome(s). To increase the likelihood 
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of integration at a precise location, the integrational elements should preferably 
contain a sufficient number of nucleic acids, such as 100 to 10,000 base pairs, 
preferably 400 to 10,000 base pairs, and most preferably 800 to 10,000 base pairs, 
which are highly homologous with the corresponding target sequence to enhance th_e 
probability of homologous recombination. The integrational elements may be airy 
sequence that is homologous with the target sequence in the genome of the host cell. 
Furthermore, the integrational elements may be non-encoding or encoding nucleic 
acid sequences. On the other hand, the vector may be integrated into the genome of 
the host cell by non-homologous recombination. 

[92] For autonomous replication, the vector may further comprise an origin of 
replication enabling the vector to replicate autonomously in the host cell in question. 
Examples of bacterial origins of replication are P15A, pSClOl, pMBl and ColEX. 
Origins of replication of plasmids pBR322 (which has a pMBl origin of replication) 
pUC19 (which has a ColEl origin of replication), pACYC177 and pACYC184 (whiclh 
have a P15A origin of replication), permit replication in E. coli\ origins of replication! 
for plasmids pUBl 10, pE194, pTA1060, or pAM.beta.l permit replication in Bacillus. 
Examples of origins of replication for use in a yeast host cell are the 2 micron origizn 
of replication, ARS1, ARS4, the combination of ARS1 and CEN3, and tke 
combination of ARS4 and CEN6. The origin of replication may be one having a 
mutation which makes its functioning temperature-sensitive in the host cell (see, e.g--, 
Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433). 

[93] More than one copy of a nucleic acid sequence of the present invention ma^y 
be inserted into the host cell to increase production of the gene product. An increas e 
in the copy number of the nucleic acid sequence can be obtained by integrating at 
least one additional copy of the sequence into the host cell genome or by including am 
amplifiable selectable marker gene with the nucleic acid sequence where celLs 
containing amplified copies of the selectable marker gene, and thereby additional 
copies of the nucleic acid sequence, can be selected for by cultivating the cells in the 
presence of the appropriate selectable agent. 

[94] The procedures used to ligate the elements described above to construct tlxe 
recombinant nucleic acid construct and expression vectors of the present invention are 
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well known to one skilled in the art (see, e.g., J. Sambrook, E. F. Fritsch, and T. 
Maniatis, 1989, Molecular Cloning, A Laboratory Manual, 2d edition, Cold Spring 
Harbor, N.Y.). 

[95] Many of the expression vectors for use in the present invention are 
commercially available. Suitable commercial expression vectors include 
p3xFLAGTM™ expression vectors from Sigma- Aldrich Chemicals, St. Louis MO., 
which includes a CMV promoter and hGH polyadenylatioxi site for expression in 
mammalian host cells and a pBR322 origin of replication and ampicillin resistance 
markers for amplification in E. coli. Other suitable expression vectors are 
pBluescriptll SK(-) and pBK-CMV, which are commercially available from 
Stratagene, LaJolla CA, and plasmids that are derived from. pBR322 (Gibco BRL), 
pUC (Gibco BRL), pREP4, pCEP4 (Invitrogene) or pPoly (Lathe et al., 1987, Gene 
57, 193-201). 

[96] Example 6 herein discloses the use of the expression vector pCKl 10900-1 Bla, 
as shown in the vector map of FIG. 3. 

Host Cells 

[97] Host cells for use in expressing the expression vectors of the present invention 
include but are not limited to, bacterial cells, such as E. coli, Streptomyces and 
Salmonella typhimurium cells; fungal cells, such as yeast cells (e.g., Saccharomyces 
cerevisiae or Pichia pastoris (ATCC Accession No. 201178)); insect cells such as 
Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, 293, and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions 
for the above-described host cells are well known in the art. 

[98] By way of example, Escherichia coli W3110 was transformed by an 
expression vector for expressing the shuffled genes of the present invention. The 
expression vector was created by operatively linking a variant gene of the present 
invention to the lac promoter under control of the lad repressor gene. The expression 
vector also contained the P15A origin of replication atzd the chloroamphenicol 
resistance gene. The transformed Escherichia coli W3110 was cultured under 
appropriate culture medium containing chloramphenicol sucti that only transformed E 
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coli cells that expressed the expression vector survived. See e?.g., Example 1. 
Purification 

[99] Once the AAM polypeptides were expressed by the variant genes in E. coli, 
the polypeptides were purified from the cells and or the culture medium using any one 
or more of the well known techniques for protein purification, including lysozyme 
treatment, sonication, filtration, salting, ultra-centrifugation, affinity chromatography, 
and the like under strict anoxic conditions. Suitable solutions for high efficiency 
extraction of proteins from bacteria, such as E. coli, are commercially available under 
the trade name CelLytic B™ from Sigma-Aldrich of St. Louis MO. A suitable 
process for purifying AAM polypeptides sufficiently from cell lysate for applications 
in a chemical process is disclosed in the references: Chirpich, T. P. et al., J. Biol. 
Chem., 1970, 245, 1778-1789; and Petrovich, R. M. et al, J. Biol. Chem., 1991, 266, 
7656-7660, both of which are incorporated herein by reference. 

Screening 

[100] After several rounds of directed evolution were performed, the resulting 
libraries of exemplary AAM polypeptides were screened. Screening for transformed 
cells that express a polypeptide having AAM activity is, in general, a two-step 
process. First, one physically separates the cells and then determines which cells do 
and do not possess a desired property. Selection is a form of screening in which 
identification and physical separation are achieved simultaneously by expression of a 
selection marker, which, in some genetic circumstances, allows cells expressing the 
marker to survive while other cells die (or vice versa). Exemplary screening markers 
include luciferase, (3-galactosidase, and green fluorescent protein. Selection markers 
include drug and toxin resistance genes, such as resistance to chloramphenicol, 
ampicillin and the like. Although spontaneous selection can and does occur in the 
course of natural evolution, in the present methods selection is performed by man. 

[101] The AAM polynucleotides generated by the mutagenesis or directed evolution 
method are screened in accordance with the protocol described in Example 8 to 
identify those having enhanced activity that are suitable for inclusion as an improved 
AAM polypeptide of the present invention. In the process of Example 8, the 
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screening of clones from the expression libraries for enhanced AAM activity was 
performed by measuring the conversion of a-alanine to p-alanine using liquid 
chromatography and mass spectrometry. Based upon the screening results, the AAM 
polypeptides of the present invention are listed in Table 2 below along with their 
residue changes and enhanced AAM activity relative to one parental AAM 
polypeptide, i.e., the polypeptide of SEQ ID NO: 59. 



Table 2 



Seq. ID No. 


Residue changes relative to 
parent SEQ ID NO: 59 


Rate of 
P-alanine(uM) 
produced /hr 
1 Cell OD 


34 


I177L, I227M, G308R, I408L, 


31.9 


10 


I298V, G308R, F416S, D447G 


6.3 


no 
OO 


L)1Z3JN, 11 / 11—,, IZIUO, 


I I .VJ 


20 


K2E, I307L, 


14.7 


14 


K13E, L17R, LI 97P, I200T, 
M281V, F310S, F416S, D447G 


7.7 


22 


Y72H, L118P, R145L, I220V, 
F240L, S250P, R311C, F416S, 
D447G 


1.0 


42 


K19R, T99S, G308R, F416S, 
D447G 


O.O 


26 


N80K, G308R, E3 19G, R325G, 
Q350R 


4.8 


18 


Q32R, S74P, SI 13T, LI 18P, 
G308R,F416S,D447G 


3.9 


44 


D79E, G308R, S329P, F393S, 
F414S, D445G, L453S, 


12.9 


51 (fragment) 


A73V, G308R,Y33lN, F416S, 
D447G 


7.0 


36 


D79E, S93P, N132D, M281I, 
G308R,Y33lN, F416S, D447G 


6.0 


48 


K2E, M76I, D79E, T131A, 
L203P, G308R, Y331C, F416S, 
D447G 


22.0 


12 


R38G, C134G, C141R, L203P, 
I280T, G308R, F416S, D447G 


3.6 


4 


2KE, I220V, N237D, G308R, 
D360G, K361R, F416S, D447G 


4.5 
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16 


K13E, L17R, L197P, I200T, 
M281V, G308R, F310S, F416S, 
D447G 


19.4 


24 


E23D, L43S, D124G, Y137H, 
K156E, G308R, D411G, F416S, 
D447G 


18.9 


46 


W18R, M76I, D79E, V90A, 
M152T, I163T, S178P, V215G, 
G308R, V354A, F416S, D447G 


20.7 


28 


E22G, Y71C, S74P, H108R, 
D187G, I244V, G308R, E396G, 
F416S, D447G,F454S 


29.2 


40 


Y137H, G308R, D41 1G, F416S, 
D422V, D447G 


2.9 


32 


H35R, D79E, K98T, T99S, 
N132S, S135P, E204G, K230R, 
G308R,F416S,D447G 


13.6 


2 


W235R, S250P, C254R, D276G, 
G308R, Y380C, I381T, F416S, 
K440E, D447G 


17.5 


30 


Q32R, N67S, H140R G308R, 
F416S,D447G 


14.3 


6 


E24G, M96I, E109G, G308R, 
F416S,D447G 


23.0 


8 


G308R, S329P, F416S, D447G, 
L455S 


14.7 



[102] In Table 2 above, it is seen that the AAM polypeptides of the present 
invention have from 2 to 1 1 residue differences than tlxeir parent polypeptide of SEQ 
ID NO: 59, and very significant AAM activity as evidenced by the production of P~ 
alanine in the assay of Example 8. In comparison, (3 -alanine was not detected for 
SEQ ID NO: 59 under the assay conditions used to test the AAM variants. However, 
some P-alanine production for parental SEQ ID NO: 59 was detected in a qualitative 
growth based complementation assay. 

[103] Referring to Table 2 above, two preferred residue changes for the AMM 
polypeptides of the present invention relative to the parental sequence of SEQ ID NO: 
59 are G308R and F416S. In those AAM polypeptides of the present invention that 
are at least 447 residues long, an additional preferred residue change is D447G 
relative to the parental sequence of SEQ ID NO: 59 . Additional suitable residue 
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changes are G308K, F416M and D447L, A, I or V. Thus, in one aspect, the present 
invention is directed to an AAM polypeptide having at least 5 amino acid residue 
changes, typically 5-11 residue changes, relative to SEQ ID NO: 59 or a truncated 
fragment thereof as taught herein, the residue changes including from 1 to 3 residue 
changes selected from the group consisting of G308IL, G308K, F416S, F416M, 
D447G, D447L, D447A, D447I and D447V. 

[104] Based upon the AAM activity in Table 2, an especially preferred AAM 
polypeptide of the present invention is a polypeptide having 95% sequence homology 
with the polypeptide of SEQ ID NO: 34, more preferably 98% homology, most 
preferably 99% homology. 

[105] The parental polypeptides of SEQ ID NOs: 53, 55 and 57 demonstrate that the 
residues 1-8 at the N-terminus and residues 434-473 at the C-terminus are not 
necessary for KAM or AAM activity. Likewise, the polypeptide fragment of SEQ ID 
NO: 51, which is a 399 residue expression product, discloses that the first 72 amino 
acids at the N-terminus relative to the parental clone of SEQ ID NO: 59 are not 
necessary for AAM activity. (See Table 2) Thus, it is also within the scope of the 
present invention that the polypeptides described herein include fragments thereof that 
lack from 1 to 72 residues from their N-terminus relative to the parental sequence of 
SEQ ID NO: 59, typically from 1 to 40 residues, more typically from 1-20 residues, 
most typically from 1 to 1 1 residues. It is also within the scope of the present 
invention that the above described N-terminal truncation be utilized in combination 
with a C-terminal truncation as described elsewhere herein. 

[106] Only a very few (< 0.5%) of the mutations to the parental B. subtilis KAM 
(SEQ ID NO: 59) backbone were found to be beneficial. Specifically, for every 1000 
clones screened, there occurred only 3-5 single point or double point mutations that 
were beneficial. In fact, some of the mutations were found to be detrimental. 

[107] The first of the following two sets of sequences provides the sequence of the 
wild type B. subtilis lysine 2,3-aminomutase (KAM) polypeptides of the prior art, as 
deposited (GI_2529467_GB_AAB81159.1 J. This sequence (SEQ ID NO: 60) was 
not used as a parent sequence but is provided only for purposes of comparison. 
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MKNKWYKPKRHWKEIELWKDVPEEKWNDWLWQLTHT 

VRTLDDLKKVINLTEDEEEGVRISTKTIPLNITPYYASL 

MDPDNPRCPVRMQSVPLSEEMHKTKYDLEDPLHEDED 

SRVPGLTHRYPDRVLFLVTNQCSMYCRYCTRRRFSGQI 

GMGVPKKQLDAAIAYIRETPEIRDCLISGGDGLLINDQI 

LEYILKELRSIPHLEVIRIGTRAPVVFPQRITDHLCEILK 

KYHPVWLNTHFNTSIEMTEESVEACEKLVNAGVPVGN 

Q A VVL AGIND SVPIMKKLMHDLVKIRVRPYYIYQCDLS 

EGIGHFRAPVSKGLEIIEGLRGHTSGYAVPTFVVDAPGG 

GGKIALQPNYVLSQSPDKVILRNFEGVITSYPEPENYIP 

NQADAYFESVFPETADKKEPIGLSAI FADKEVSFTPENV 

DRIKRREAYIANPEHETLKDRRERRDQLKEKKFLAQQK 

KQKETECGGDS S 



[108] The second sequence in the set indicates the diversity of the AAM 
polypeptides of the present invention relative to the known wild-type B. subtilis KAM 
sequence by designating with the letter "X" followed by the residue number those 
residues in the Applicants' AAM polypeptides that differ from those of wild-type B. 
subtilis KAM sequence: 



M X 2 N K W Y K P K R H W X 13 E I E Xiv W Xi 9 D V P X 23 X 24 K W N D W L W 
X32 L T X35 T V X38 T L D D X43 KKVINLTEDEEEGVRISTKTIPL 
X 67 I T P X71 X 72 X 73 X 74 L M D P X79 X 80 P R C P V R M Q S V P L X 93 E E X 96 H 
X98 X99 KYDLEDPL Xi 08 X09 DEDS X n4 V P G X n8 T H R Y P Xi 24 R V L F 
L V T X t32 Q X134 X135 X136 X137 C R Xho Xui TRR X145 FSGQIGMGVP 
Xi 56 KQLD AAIAYIRETPEIRDCLIS GGDGLLINX 18 7QILEYI 
L K E X 197 R S X 200 P H X 203 X 204 V I R I G T R A P V V F P Q R I T D H X 224 C E I 
L K X 23 o X 23 1 H P V X 235 L X 237 T H X 240 NTSIEMTEE X 250 V E A X 254 E K L 
VNAGVPVGNQAVVLAGINX 27 6SVPX 280 X 2 8iKKLMHDLVKI 
RVRPYYIYQCDLSEGX307 X308 H X310 X311 A P V S K G L X319 1 1 E G L 
R G H T X 329 G X331 A V P T F V V X339 APGGGGKIAL X 350 P N Y V L S Q 
SPX 360 KVILRNFEGVITS YPEPENX 38 oX 3 8iPNQ AD A YFES V 
X393PX395TADKKEPIGLS AX408F AX411 KEVSX416TPENVX422RI 
KRRE AYIANPEHETLX440DRREX445RX447QLKEKKX454X455A 
QQKKQKETECGGDSS 



The diversity of changes at various residue positions for the AAM polypeptides of the 
present invention are shown to the right of the arrow in Table 2 below and relative 
amino acid residues of wild-type KAM of B. subtilis 
(GI_2529467_GB_AAB81159.1J (SEQ ID NO: 60) which are shown to the left of 
the arrow: 
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Table 3 



x 2 


K^E 




X13: 


K— ► E 




X17: 


L^R 




X19: 


K->R 




X23: 


E^ D, G 




X24' 


E^ G 




X32: 


Q-> R, 




X35: 


H— > R 




X38: 


R— ► G 




X43: 


S 




X67^ 


N-^ S 




X71: 


Y— ► C 




X72: 


Y-^ H, W 




X73 : 


V 




X74: 


S^P 




X79: 


D^E 




Xso: 


N->K 




X93: 


S-> P 




X 9 6: 


M— >I 




Xgg: 


K-^T 




X99: 


T— ► S 




X108" 


H~>R 




X109: 


E^ G 




X114: 


R-*P 




Xns: 


L^P 




Xi24- 


D— >N 





X132: 


N—> D, S 




X134: 


C— » G 




X135: 


S— > P 





Xi36: 


M— ► V 




X137: 


Y— > H 




X140: 


Y— > H 




X141 : 


C^R 




X145: 


R^L 




X156: 


K-*E 




X187: 


D~>G 




X197: 


L— *P 




X200- 


I->T 




X203: 


L-+P 




X 2 04- 


E-+G 




X224^ 


L-^P 




X230: 


K^R 




X231: 


Y->H 




X235' 


W-»R 




X237: 


N->D 
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X240 


F-+L 


X250 


S^P 


X254 


C-> Y, R 


X276 


D-»G 


X280 


I^T 


X281 


I 5 V 


X307 


I^L 


X308 


G->R 


X310 


F->S 


X311 


R^C 


X319 


E^G 


X329 


S^P 


X331 


Y^N 


X339 


D^H 


X350 


Q-R 


X360 


D^G 


X361 


K^R 


X380 


Y~>C 


X381 


I^T 


X393 


F^ S 


X395 


E^G 


X408 


I^L 


X411 


D^G 


X416 


F->S 


X422 


D-> V 


X440 


K^E 


X445 


R^K 


X447 


D^G 


X454 


F^S 


X455 


L^S 



[109] In a fourth aspect, the present invention is directed to a method of making an 
AAM a nucleic polypeptide of the present invention comprising (a) cultivating a host 
cell transformed with a nucleic acid sequence encoding an AAM polypeptide of the 
present invention under conditions suitable for production of the polypeptide; and (b) 
providing glucose to the cultivated host cells tinder conditions suitable for the 
production of P-alanine. The |3-alanine may be optionally recovered from the cells. 

Example 1: Transformation protocol for nam libraries/ ApanD strain 

[110] A mutant E. coli strain - ApanD, derived from BW251 13 which is described in 
Datsenko, K.A. and Wanner, B.L., Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000) 
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was used as the host strain for screening of the aarn gene libraries. The protocol used 
to make the deletion is detailed in Example 4 of Cargill patent application WO 
03/062173. 

[Ill] Chemical competent E. coli ApanD was removed from -80°C frozen storage 
and thawed. Thereafter, it was kept on ice until used. An aliquot (100|il per 
transformation) was transferred into a sterile 1.5ml centrifuge tube. A KCM (5X) salt 
solution was added until the concentration in the aliquot was IX. KCM consists of 
700 mM KC1; 10 mM morpholinopropanesulphonic acid (MOPS) adjusted to pH 5.8. 
l-5\xl of the ligation mixture was added to the cells. The cells containing the ligation 
mixture were first incubated on ice for 30 minutes. The cells were heat shocked at 
42°C for 1 min, and subsequently incubated on ice for 2 minutes. 500jul of SOC 
(Maniatis, T., Fritsch, E. F., and Sambrook, J. (1982) Molecular Cloning: A 
Laboratory Manual, 1st Ed., pp. A.2 and A.3, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY) was added to the cells, and the cells were incubated at 37°C for 1 
hour with agitation. The cells were then centrifuged at 5000 rpm for 3 minutes, and 
the SOC was removed. The cell pellet was re-suspended in 500^1 of M9 selection 
medium ((Maniatis, T., Fritsch, E. F., and Sambrook, J. (1982) Molecular Cloning: 
A Laboratory Manual, 1st Ed., pp. A.2 and A.3, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY) and incubated at 30°C for 2-4 hours with agitation. The cells 
were then plated onto M9 minimal agar mediunx supplemented with 1% mannose, 
20jllM iron citrate, 5.0 g/1 oc-alanine, O.lmM isopropyl-p-D-thiogalactoside (IPTG) 
(Sigma Chemical Corp., St. Louis, MO), 50mM[ MOPS, 25mM bicarbonate, and 
30|iig/ml chloramphenicol. The plated cells were incubated at 30°C for 3 days or until 
colonies were of sufficient size to be picked using the Q-BOT™ robot colony picker ( 
Genetix USA, hie, Boston MA). 

[112] In Round 2 of the transformation, the above procedure was followed except 
that the incubation temperature of the last two incubations in the procedure was 
increased to 37°C ? and M9 minimal selection medium was not supplemented with oc- 
alanine (0 g/L oc-alanine). 
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A. Alternate Transformation protocol for aam libraries/ ApanD KlfldA 
strain 

[113] A mutant E. coli strain tspanJD, derived from BW251 13 which is described in 
Datsenko, K.A. and Wanner, B.L., Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000) 
is used as the host strain for screening of the aam gene libraries. The protocol used to 
make the deletion is detailed in Example 4 of International patent publication WO 
03/062173. Optimally, a strain additionally having an increased expression of the 
flavodoxin (fldA) gene was used as the host strain for screening of the aam gene 
libraries, since increased flavodoxin enhances aminomutase activity when produced in 

E. coll See USSN , by Cargill, Inc. (Liao, et al), filed October 14, 

2005, entitled "Increasing the Activity of Radical S-Adenosyl Methionine (SAM) 
Enzymes" describes the production of p~alanine from cells that express AAM and 
overexpress flavodoxin at Examples 1-4, and these examples are incorporated herein 
by reference. This same application, USSN , by Cargill, Inc. (Liao, et 

al.) filed October 14, 2005, describes in Example 4 (incorporated herein) the 
construction of a strain of E, coli in which an artificial Pi ac /ara hybrid promoter was 
placed immediately upstream of the fldA gene. Strains carrying the artificial promoter 
before the fldA gene are designated KLifldA, where KI refers to "knock-in"). 

[114] Competent cells of E. coli AgtanD KlfldA are prepared either chemically or 
electrochemically using standard protocols. Competent E. coli ApanD KlfldA was 
removed from -80°C frozen storage and thawed. Thereafter, it was kept on ice until 
used. An aliquot (IOOjlxI per trans formation) was transferred into a sterile 1.5ml 
centrifuge tube. A KCM (5X) salt solution was added until the concentration in the 
aliquot was IX. KCM consists of 700 mM KC1; 10 raM 
morpholinopropanesulphonic acid (MOPS) adjusted to pH 5.8. l-5|al of the ligation 
mixture was added to the cells. The cells containing the ligation mixture were first 
incubated on ice for 30 minutes. Th.e cells were heat shocked at 42°C for 1 min, and 
subsequently incubated on ice for 2 minutes. 500^x1 of SOC (Maniatis, T., Fritsch, E. 

F. , and Sambrook, J. (1982) Molecular Cloning: A Laboratory Manual, 1st Ed., pp. 
A.2 and A.3, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) was added to 
the cells, and the cells were incubated at 37°C for 1 hour with agitation. The cells 
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were then centrifuged at 5000 rpm for 3 miniates, and the SOC was removed. Pellets 
were subsequently resuspended in a medium appropriate for either the 
complementation assay (Example 3) or the biotransformation assay (Example 4). 

Example 2: Cloning of aam genes into pCKl 10900 series vectors 

[115] The strategy employed for cloning the alanine aminomutase genes into an 
inducible expression system involved the isolation of the aam gene by PCR and 
cloning of the PCR fragment into the Sfil restriction sites downstream from a mutant 
lac promoter/operator system. Initially, PCR primers were designed to contain a 
nucleotide sequence that is specific to the 5' and 3 ? ends of the aam gene, as well as 
the Shine-Delgarno sequence of the ribosome-binding site, and the unique Sfil 
restriction sites. The gene was then amplified from a template, purified and digested 
with the restriction endonuclease Sfil. The restricted PCR fragment was purified 
using the QIAquick PCR purification kit (Qiagen), and cloned into the Sfil sites of the 
expression vector pCKl 10900-1 Bla of FIG. 3 under the control of a lac promoter and 
lad repressor gene. The expression vector also contained the PI 5a origin of 
replication and the chloramphenicol resistance gene. Shuffled aam gene libraries 
were cloned by the same method. Several clones were found that expressed an active 
alanine 2,3-aminomutase (as per the method of Example 8) and the synthetic genes 
were sequenced. A polynucleotide sequence designated BSAAM (SEQ ID NO: 58) - 
was used as a starting material for all further mutations and shuffling. BSAAM (SEQ 
ID NO: 58) has approximately 99.2% nucleotide identity with the wild-type Bacillus 
subtilis lysine aminomutase (GenBank Accession No. H10329). 

Example 3: Screening via the Tier 2a growth assay 
Tier 2a growth Assay 

[116] The growth assay identifies variants capable of generating the essential 
metabolite AcetylCoA via (3-alanine produced by AAM variants in the E. coli hpanD 
host strain. Growth is therefore a function of Co A production, and indirectly of AAM 
activity. 
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A. Procedure 

[117] AAM active clones from the tier 1 complenxentation assay were picked with a 
QBOT™ robot colony picker (Genetix USA, Inc., Boston MA) and inoculated into a 
96-well master plate. The inoculums were grown in the 96 well master plate on a 
buffered minimal selection media (Na 2 HPO. 7H z O 12.8g/L; KH 2 P0 4 3g/L; NaCl 
0.5g/L; NH 4 C1 lg/L; MgSG 4 2mM; CaCl 2 0.04mM; mannose 2%; IPTG ImM; ferric 
citrate 20 uM; chloramphenicol 30 |~ig/ml; MOPS pH 7, 50mM; and sodium 
bicarbonate pH 9, 25mM) (hereinafter "MSM") to Avhich was added O.luM p-alanine 
and 0.5g/L a-alanine. Plates were covered using AirPore™ microporous tape 
(Qiagen, Inc.) and incubated at 25°C, 250 rpm, 85 G /o humidity until cultures reached 
saturation, at which time glycerol was added to the cultures to a final concentration of 
20-30%, and the plates stored at -80°C. 

[118] Samples from a frozen master plate were arrayed into an "inoculum" plate 
containing buffered minimal selection media (MSM), as described above, further 
containing 0.5g/L a-alanine. The inoculum plates were covered with AirPore™ 
microporous tape (Qiagen, Inc.) and incubated at 25°C, 250 rpm, 85% humidity until 
cultures reached saturation. 

[119] ISjliI from the inoculum plate was inoculated into a 96-well "assay" plate 
containing 185p,l of fresh MSM with 0.5g/L a-alanine. The assay plates were 
covered with AirPore™ microporous tape (Qiageix, Inc.) and a lid, and incubated at 
25°C, 85% humidity, 250rpm. Measurements of OD at 600nm were made at discrete 
times for a period of approximately (~) 40hours. 

B. Data Analysis 

[120] Since library variants exhibit unique growth profiles, it was preferable to 
calculate and compare growth rates (slopes) at three (3) different growth phases 
(early, mid and late) to identify all potentially improved variants. Clones that exhibit 
three (3) standard deviations above the plate average in any of the three (3) phases 
were designated as potentially improved variants and were retested in tier 2b for 
comparative ranking. 
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Example 4: Screening via ttie Tier 2b growth assay 

[121] The stringency of the growth screen is increased in Tier 2b by excluding oc- 
alanine (the substrate for AAMQ from the medium. Under these conditions, the cell 
relies on internal/cellular pools of a-alanine to serve as a substrate for AAM, and 
subsequently, for cell growth. AAM variants capable of utilizing low, intracellular 
pools of a-alanine might represent low Km variants. 

A. Procedure 

[122] Samples from a frozen master plate were arrayed into an "inoculum" plate 
containing buffered minimal selection media (MSM), as described above, further 
containing 0.5g/L a-alanine. The inoculum plates were covered using AirPore™ 
microporous tape and incubated at 25°C, 250 rpm, 85% humidity until cultures 
reached growth saturation. 

[123] A TECAN™ Robotic Sample Processor (Columbus, Ohio) was used to 
remove 10|nl of inoculum from each Tier 2a variant from the inoculum plates and seed 
it in replicates of 8 into each of the following: 

96-well Assay plate containing 1 90jli1 of fresh MSM, 0.5g/L a-alanine. 

96-well Assay plate containing 1 90jal of fresh MSM, containing no a-alanine. 

The Assay plates were covered with AirPore™ microporous tape and a lid and grown 

at 25°C, 85% humidity, 250rpm. Samples were collected at time points for 

approximately 3-4 days and the OD 6 oonm was measured for each sample. 

B. Tier 2b Data Analysis 

[124] Variants were ranked by the following 3 criteria: 

i) Growth ratio equal to a final culture OD600 on medium without a-alanine/final 
culture OD 6 oonm on medium containing a-alanine; 

ii) Final culture ODeoo; and 

iii) Initial growth rates (in pliase 1, from approximately 0-20 hour) 
Clones with final culture OD 6 oanm > 0.7 were retained. 
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Clones were then ranked based on the growth ratio of criteria (i). Any clones with a 
OD 60 onm > 0.7 were retained. However, clones that did not meet the above two 
criteria, but had a very good initial growth rate (iii) were also selected for further 
evaluation. 

Example 5: Screening via Tier 2c- PCR analysis 

The PCR screen identifies variants that contain the correct size gene in the expression 
vector prior to further screening for function. It excludes unstable gene variants that 
may have undergone deletions/truncations during the screening process. 

A. Procedure 

Potentially improved variants from frozen master plates were inoculated into a 96- 
microwell plate containing LB media with 1% glucose and 3<3|ug/mL 
chloramphenicol. Cultures were grown at 25°C, 250 rpm, 85% humidity ion plates 
covered with AirPore™ microporous tape (Qiagen, Inc.) until cultures reached 
saturation, approximately 2 days. IOjliL of the culture was transferred to a PCR plate 
and boiled at 99°C for 10 minutes to disrupt the cells. Thereafter, 90 jiL of the 
following PCR Master Mix was added to the disrupted cells: 

PCR Master Mix: 



4 \xL 



10 \xL 



10X Taq Polymerase Buffer (QIAGEN, Valencia C^V) 
25 mM MgCl 2 



2 jllL 



lOmMdNTPs 



1.25 |liL 
1.25 jliL 
1 iiL 



20 )llM primer - B forward (specific for BsAAM gene) 
20 j-iM primer - B reV erse (specific for BsAAM gene) 
5U/pL Taq polymerase (QIAGEN) 



70.5 jllL 



Sterile water 



90 jiL 



Total volume 



The Bacillus specific primers used in the PCR reaction are as follows: 



WO 2006/047589 



PCT/US2005/038552 



-44- 



B-forward: 

5'ccagcctggccataaggagatatacatatgaaaaacaaatggtataaac 3 1 SEQ ED NO: 63 
B -reverse: 

5' atggtgatggtgatggtggccagtttggccttatgaagaatcccctccgc 3' SEQ ID NO: 64 



The amplification reaction was run for 30 cycles. The first cycle was run at 94° C for 
1 minute. Thereafter, the extension procedure was performed for 29 cycles: 9-4. 0°C 
for 1 minute; 55.0°C for 30 seconds; and 72.0°C for 1 minute. The final extension 
was performed at 72.0°C for 5 minutes. The products of the PCR reactions were 
analyzed by gel-electrophoresis on a 0.8% agarose gel. 

Example 6: Growth of AAM variants for p-alanine production (50 ml scale). 
Cell selection method for identifying AAM activity, 

[125] To identify genes encoding polypeptides that can perform the alanine 2,3- 
aminomutase reaction, an efficient screen or selection for the desired activity is 
needed. Therefore, a selection method was developed by recognizing that E. coli uses 
beta-alanine for the synthesis of pantothenic acid, which in turn is a component of 
coenzyme A (CoA) and of acyl carrier protein (ACP). CoA and ACP are the 
predominant acyl group carriers in living organisms, and are essential for growth. 

[126] In E. coli, the primary route to beta-alanine is from aspartate in a reaction 
catalyzed by aspartate decarboxylase (E.C. 4. 1. 1.1 1), which is encoded by the panD 
gene. A functional deletion mutation of panD (shown as ApanD) results in beta- 
alanine auxotrophy and growth inhibition, which can be alleviated by the exogenous 
addition of pantothenate or beta-alanine, or by the production of beta-alanine from 
another source. 

[127] Strain description: E. coli ApanD host (derived from BW25113, described in 
Datsenko, K.A. and Wanner, B.L., Proc. Natl. Acad. Sci. USA 97:6640-6645 (2O00)), 
transformed with pCKl 10900-1 Bla vector (low promoter strength resulting from 
mutated lac promoter sequence). The inoculum culture was grown in buffered 
minimal selection medium (MSM): M9 salts, pH 7.0-7.4, 50mM MOPs, pH 7.0, 25 
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mM sodium bicarbonate, pH 9.0, ImM isopropyl-f3-D-thiogalactoside (IPTG), 
30|ug/ml chloramphenicol, 0.1 g/L alanine, 5uM pyridoxine HC1, and 20uM ferric 
citrate. A 1:20 dilution of inoculum was used to inoculate 50ml of MSM mediumi 
described above. Cultures were incubated at 25°C, 250 rpm for approximately 3 da.ys 
or until the culture reaches OD 6 oonm ~1- Then, a-alanine was added to the medium to 
a final concentration of 300 mM, and pantothenate was added to about 300u3M. 
Incubation of the supplemented medium continued at 25°C, 250 rpm. Samples were 
removed from the medium for analysis at time points from t= 0 through t=5 hoiars 
following the addition of a-alanine. 

Example 7: Method for extracting cells for p-alanine detection 

[128] Cells from the cultures of Example 6 were harvested by centrifugation of the 
cultures. The supernatant (spent media) was decanted and saved for further analysis 
(below). The cell pellets were washed with water. Pellets may be stored at -80°C for 
future extraction. The 50ml cell pellets (OD - 4.0) were re-suspended completely in 
a test tube in 0.9 ml water. The extraction volume for each sample was adjusted to 
this proportion according to the harvest OD 6 oo- An equal volume of methanol (-20°C) 
and 200 jlxL of micro-glass beads was added and the mixture vortexed vigorously. 
Tubes containing the mixtures were placed on dry ice/EtOH, or in a -80°C freezer, for 
about 30 min. The frozen contents in the tube were thawed at room temperature a.nd 
vortexed vigorously again, and centrifuged at maximum speed for about 10 minutes. 
The supernatants were filtered using 0.2-0.45 micron filter plates, or syringe filters. 

[129] The spent medium was filtered using a 0.2-0.45 micron filter plate or syringe 
filter. The filtered spent medium was diluted 1:10 in -20°C methanol/water (final 
methanol concentration 50%). 

[130] The p-alanine content of cell extract and spent media was analyzed by 
LC/MS/MS (Example 8). 

For spent medium sample, the first minute was diverted to waste. The (3-alanine peak 
arrived at approximately 2.0 minutes. 
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The assay can be scaled to 2ml, if only the spent media is analyzed. 



Example 8: Assay for p-alanine (LC/MS/MS) 

[131] P-alanine was determined using a combination of liquid chromatography and 
mass spectrometry. Suitable analytes were the cell extracts and spent media as 
prepared in Example 7. 

[132] The liquid chromatography (LC) phase was performed using an ASTEC 
CHIROBIOTIC™ T 4.6 cm x 50 mm chiral LC column (Advanced Separation 
Technologies, Inc., Wliippany, N.J. USA). The mobile phase consisted of two 
solutions: A: 0.25% aqueous acetic acid; and B: 0.25% (v/v) acetic acid in methanol. 
The elution was isocratic @ 0.6 ml/minute. 

[133] The mass spectrometer (MS) analysis was performed on a Micromass Ultima 
Triple Quad mass spectrometer, using the following tune parameters: 
Capillary: 3.5 kV; cone: 20 V; hex 1: 15 V; aperture: 1.0V; source temp: 100°C; 
desolvation temp: 350°C; cone gas: 40 L/hr; desolvation gas: 500 L/h; low mass 
resolution(Ql): 12; higli mass resolution (Ql): 12; ion energy (Ql): 0.1; collision cell 
entrance: -5; collision energy: 14; exit: 1; low mass resolution (Q2): 15 high mass 
resolution (Q2): 15; ion energy (Q2): 3.0; multiplier: 650 V. 



MS Method 
Alanine transitions 

Analyte Parent Ion (m/z) Daughter Ion (m/z) Dwell Time (sec) 

a-alanine 90 44.7 0.1 

p-alanine 90 30.7 0.1 

a-lysine 147 84.5 0.1 

P-lysine 147 70.5 0.1 
The inter-channel delay was 0.1 seconds. 
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CLAIMS 

WHAT IS CLAIMED IS: 

1. A polypeptide having alanine 2,3-aminomutase activity (hereinafter an 
"AAM polypeptide") and 

(a) having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
48 and 51: 

(b) having an amino acid sequence which has at least 98% homology with the amino 
acid sequence selected from the group consisting of SEQ ID KTO: 2, 22, 28, 32, and 
36; 

(c) having an amino acid sequence which has at least 99% homology with the amino 
acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 12, 16, 24, 
26,30, 34 and 40; 

(d) being a polypeptide encoded by a nucleic acid sequence wliich hybridizes under 
high stringency conditions with either (i) the nucleotide sequeixce of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21,23, 25, 27, 29,31,33,35, 37, 41,43,45, 47 or 49; 
(ii) a subsequence of (i) of at least 100 nucleotides, or (iii) a complementary strand of 
(i) or (ii); or 

(e) being a variant of the polypeptide of (d) comprising a substitution, deletion, and/or 
insertion of one to six amino acids therefrom and having AAM activity from about 1 
to about 30 \jM P-alanine produced /hour 1 cell OD at pH 7.0-7 .6, 25°C. 

2. The polypeptide of claim 1 having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 and 51. 

3. The polypeptide of claim 1 having an amino aci<l sequence which has 
at least 98% homology with the amino acid sequence selected from the group 
consisting of SEQ ID NO: 2, 22, 28, 32, and 36. 
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4. The polypeptide of claim 1 having an amino acid sequence which has 
at least 99% homology with the amino acid sequence selected from the group 
consisting of SEQ ID NO: 4, 6, 8, 12, 16, 24, 26, 30, 34 and 40. 

5. The polypeptide of claim 1 being a polypeptide encoded by a nucleic 
acid sequence which hybridizes under high stringency conditions with either (i) the 
nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21 , 23, 25, 27, 29, 
31, 33, 35, 37, 41, 43, 45, 47 or 49; (ii) a subsequence of (i) of at least 100 
nucleotides, or (iii) a complementary strand of (i) or (ii) 

6. The polypeptide of claim 1 being a variant of the polypeptide of (d) 
comprising a substitution, deletion, and/or insertion of one to six amino acids 
therefrom and having AAM activity from about 1 to about 30 jllM P-alanine produced 
/hour 1/cell OD atpH 7.0-7.6, 25°C. 

7. An AAM polypeptide having an amino acid sequence of SEQ ID NO: 
2, 6, 12, 16, 20, 24, 28, 30, 32, 34, 38, 44, 46 or 48. 

8. The AAM polypeptide of claim 7 having an amino acid sequence of 
SEQ ID NO: 6, 12, 28, 34, 46 or 48. 

9. The AAM polypeptide of claim 8 having an amino acid sequence of 
SEQ ID NO: 28 or 34. 

10. A polynucleotide encoding an AAM polypeptide of claim 1 . 

11. A polynucleotide encoding a polypeptide having AAIM activity, said 
polynucleotide having SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21 , 23, 25, 27, 29, 
31, 33, 35, 37, 41, 43, 45, 47 or 49. 

12. An isolated and purified polynucleotide which encodes a polypeptide 
of claim 1 . 

13. An expression vector comprising a polynucleotide of claim 10 or 11 
operatively linked to a promoter. 
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14. A host cell transformed to express a polynucleotide of claim 1 0. 

15. A method of making an AAM polypeptide of claim 1, comprising (a) 
cultivating a host cell comprising a nucleic acid construct comprising a nucleic acid 
sequence encoding the AAM polypeptide under conditions suitable for production of 
the polypeptide; and (b) recovering the AAM polypeptide. 

16. An AAM polypeptide of claim 1 in lyophilized form. 

17. A composition comprising a polypeptide of claim 1 in a buffered 
medium. 

18. An AAM polypeptide having from 5 to 1 1 amino acid residue changes 
relative to SEQ ID NO: 59 or a fragment thereof, the residue changes including from 
1 to 3 residue changes selected from the group consisting of G308R, G308K, F4-16S, 
F416M, D447G, D447L, D447A, D447I and D447V. 
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SEQIDNO: 

1 

50 

P_GI2529467_G8_AAB81159 . 1_ 60 (1) 

MKNKWYKPKRHWKE I ELWKDVPEEKpiSMLflQLTHTVRTLDIDL KKVINgT 
P_GI2 63 43 6 1_EMB_CAB 1 3 8 6 0 . 1_ 61 ( 1 ) 
MKNKWYKPKRHWKE I EL WKI)^ 

P_S0 0 701550 59 (1) 
MKNKWYKPKRHWKEIELWKDpPEEK^NplLiQ 

P_S00701551 53 (1) 

MS LKDKFlTH^S QED^lSTOK^Q^RNRllTi^ELKiY I P§T 

P_S 0 0 7 0 15 52 55 (1) 

MAE s|RKYYgPD^TI3EQf Yi^IflQ|LNR||iTLDQLKaa^;Tj|T 

P_S0 1032894 57 (1) 

MNTWTRKKFSPN?T;D^ 

Consensus 62 (1) 
MKNKWYKPKRHWKE I ELWKDVPEEKWNDWLWQLTHTVRT LDDLKKVINLT 



FIG. 4A 



51 

100 

P_GI2529467_G8_AAB81159 . 1_ (51) 

EDEEg&VjRI STKTIPLNI ^YYASSm5pDNPRC£|V^MQS VpLS Eg§MHKTK 
P_GI 26 3 4 3 6 1_EMB_CAB 1 .3 8 6 0 . 1_ (51) 

EDE EpGV'R I S TKT I PLNjl^PYYASLjMitPDNPRC prV|SMy S vSl S E||MHKTK 

P_s50 7 01556^ (51) 
ED^EgplRI STKT I PLN^ 

P_S0 07 01551* (41) 

p|§iE^ 

"p^s 00701552 (43) 

ApSEEG\§fESPKV^ 

P_S01032894 (44 )^ 

E(EEfrp(^ 

Consensus (51) 
EDEEEGVRISTKTIPLNITPYYASLMDPDNPRCPVRMQSVPXjSEEMHKTK 



FIG. 4B 
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SEQUENCE LISTING 

<110> Craatterjee, Ran j ini 
Crien, Michelle 
Louie , Susan 
Mitchell, Ken 
Fox, Richard 

<120> Improved Alanine 2 , 3 -Ami norma t as es and Related Polynucleotides 
<130> 0359 .210WO/15686WO02 
<160> 64 

<170> Patentln version 3.3 

<210> 1 

<211> 1416 
<212> 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 1. 
atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagct 


atggaaggac 


60 


gttccggaj.ag 


agaaatggaa 


cgattggctt 


tgacagctga 


cgcacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


300 


tacgatafcgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggc tgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


taccatccgg 


tccggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaaccc 


gttgaggcac 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatggctc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaacjggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 
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ccaggcggag ggggtaaaat cgccctgcag ccgaactatg tcctgtctca aagtcccgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattgt 1140 

acccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaaaaag 1200 

gagccgatcg ggctgagtgc catttttgct gacaaagaag tttcgtctac acccgaaaat 1260 

gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattagaa 132 0 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 13 80 

cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 2 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 2 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Tarp Lys Glu lie Glu 
15 10 15 

Leu Trp- Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
2 0 25 3 0 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu L^s Lys Val lie Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Sor Thr Lys Thr lie 
50 55 SO 

Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Hot Asp Pro Asp Asn 
65 70 75 ^ 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu Hlis Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 
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Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin lie Leu Glu Tyr 
180 ' 185 190 



lie Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Arg Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Pro Val Glu Ala Arg Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Gly Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 3O0 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



WO 2006/047589 



PCT/US2005/038552 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Cys Thr Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 

385 " " 390 395 400 

Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 

405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Glu Asp Arg Arg Glu Lys Arg Gly Gin 

435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 3 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 3 

atggaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 
gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 120 
gatgatttaa agaaagtcat taatctgacc gaggatgaag aggjaaggcgt ccgtatttct 180 
accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaat 
ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 
tacgatatgg aagacccgct tcatgaggat gaagattcac cgcjtgcccgg tctgacacac 3 60 
cgctatcccg accgtgtgct gtttcttgtc acgaatcagt gttccgtgta ctgccgccac 42 0 
tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 



240 
300 



480 



gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 



gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 



600 
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ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcgtt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgga 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gt tgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


acraatgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


at cagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctggc 


1080 


agagtgatct 


taagaaattt 


tgaaggtgtg 


at tacgtcat 


acccggaacc 


agagaat tat 


1 1 a n 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


at cgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 4 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 4 

Met Glu Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
1 " 5 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 2 5 3 0 

Leu Thr His Thr Val Arg Thr Leu Asrp Asp Leu Lys Lys Val lie Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala. Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 
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Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leia His Glu Asp Glu Asp 
100 105 110 

Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 

Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 

Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 

Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 

He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 

He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 

Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg Val Thr Asp His Leu 
210 ~ 215 220 

Cys Glu He Leu Lys Lys Tyr His Pro Val Tirp Leu Asp Thr His Phe 
225 230 235 240 

Asn Thr Ser He Glu Met Thr Glu Glu Ser Va.1 Glu Ala Cys Glu Lys 
245 250 255 

Leu Val Asn Ala Gly Val Pro Val Gly Asn GILn Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 
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Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 "* 310 315 320 



He Glu Gly Leu Arg Gly His Thr Sear Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Gly Arg- Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Ph_e Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He L>^s Arg Arg Glu Ala Tyr He Ala 
420 42 5 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala G3_n Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 5 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 5 

atgaaaaaca aatggtataa accgaaacgg czattggaagg agatcgagtt atggaaggac 60 
gttccggaag ggaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 
gatgatttaa agaaagtcat taatctgacc cjaggatgaag aggaaggcgt ccgtatttct 180 
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accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


24 0 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaataca 


caaaacaaaa 


30 0 


tacgatatgg 


aagacccgct 


tcatggggat 


gaagactcac 


cggtacccgg 


tctgacacac 


36 0 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttctgtgta 


ctgccgccac 


42 0 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


48 0 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


54 0 


cratagcrctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


60 0 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgcccccg 


tcgtctttcc 


gcagcgcatt 


6S 0 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


72 0 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


7B 0 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


84 0 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


90 0 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


950 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


102 0 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


10S 0 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaat tat 




atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


120 0 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1250 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


132 0 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


13S 0 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






143_6 



<210> 6 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 6 



Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
1 5 10 15 
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Leu Trp Lys Asp Val Pro Glu Gly Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu He 
85 90 95 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Gly Asp Glu Asp 
100 105 no 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 

Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 " 175 

He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 

He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 2 40 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 ~ 255 
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Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly lie Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu lie 
305 310 315 320 



lie Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val lie Leu Arg Asn Phe Glu 
355 360 365 



Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg lie Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



WO 2006/047589 



PCT/US2005/038552 



-11- 

<210> 7 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 7 

atgaaaaaca aatggtaicaa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacca tactatgcga gcttaatgga tccagaaaac 240 

ccacgttgtc cggtacgcat gcagtctgtg ccgctttccg aagaaatgca caaaacaaaa 3 00 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 3 60 

cgctatcccg accgtgt^ct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 42 0 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc cgtgcgag-at attgaaaaaa tatcatccgg tctggctgaa cacccatttt 72 0 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 7 80 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 9 00 

tgtgatctgt cagaagg^iat aaggcatttc cgtgctcctg tctccaaagg tttggagatc 9 60 

attgaagggc tgagagg-fcca taccccaggc tatgcggttc ctacctttgt cgttcacgca 102 0 

ccaggcggag gaggtaaeiat cgccctgcag ccgaactatg tcctgtctca aagtcctgac 1080 

aaagtgatct taagaaaiztt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgtttccc ctgaaaccgc tgacaaaaag 12 00 

gagccgatcg ggctgagizgc catttttgct gacaaagaag tttcgtctac acctgaaaat 12 60 

gtagacagaa tcaaacggcg tgaggcctac atcgcaaatc cggagcatga aacattaaaa 132 0 

gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat tttcggcgca gcagaaaaaa 1380 

cagaaagaga ctgaatgcgg aggggattct tcataa 1416 
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<210> 8 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 8 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
1 5 10 15 

Leu Trp Lys Asp Val Pro GItjl Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thar Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu. Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyjr Tyr Ala Ser Leu Met Asp Pro Glu Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
10 0 105 HO 

Ser Pro Val Pro Gly Leu Thar His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 

Leu Val Thr Asn Gin Cys Sear Val Tyr Cys Arg His Cys Thr Arg Arg 
130 13 5 140 

Arg Phe Ser Gly Gin He Gl^r Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 

Ala Ala He Ala Tyr He Arc/ Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Lena Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 
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Ile Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val I3_e Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Pro 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala C^s Glu Lys 
245 250 ~ 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 2^7 0 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly L^u Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Pro Gly Tyr Ala Val Poro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu (3- In Pro Asn 
340 345 3 50 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Ser Pro Glu Thr Ala A,sp Lys Lys 
385 390 395 " 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 ^30 
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Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Ser Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 9 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 



<400> 9 
atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtgcgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tgtttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tt*tggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 
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aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 


10 


<211> 


471 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


10 



Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
1 5 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 3 0 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr He 
5 0 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 * 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 no 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
13 0 13 5 140 
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Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin Tie Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val lie Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr Val Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 
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Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Fro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala -Asp Lys Lys 
385 390 395 400 



Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg lie Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 11 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 11 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgacjtt 


atggaaggac 


60 


gtcccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aggaacgtta 


120 


gatgatttaa 


agaaagtcat 


caatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaat^ga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaag 


gttccgtgta 


ctgccgccac 


420 


cgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgttt aat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatccgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtcttccc 


gcagcgcatt 


660 
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accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaact 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggcca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 12 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 12 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
1 "* 5 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Gly Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 ' 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 ~ 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 
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Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Le^u Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu Hd_s Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Gly Ser Val Tyr Cys Arg Hi_ s Arg Thr Arg Arg 
13 0 13 5 14 0 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Ly-s Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Pro Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 22 0 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu. Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala. Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro Thr Met Lys Lys Leu. Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 
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Ile Glu Gly Leu Arg Gly His Thr Ser Gl^y Tyr Ala Val Pro Thr Phe 
325 33 0 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Va_l lie Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Gl_u Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Prro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Aztrg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Aztrg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 



465 


470 


<210> 


13 


<211> 


1416 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


13 



atgaaaaaca aatggtataa accgaaacgg cattgggagg agatcgagcg atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 

gatgatttaa agaaagtcat taatctgacc gagcjatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacct tacfcatgctt ccttaatgga ccccgacaat 240 
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ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgea 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


gggatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagee 


gcgcagcact 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


geagegcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatcegg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


gtgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcattcc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgeggtte 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagfcgafcc t 


taagaaattt 


fc era a cr cr t crt cr 


at1"a cert" cp* f- 


a l. ^ — y cicil- 




*i i a n 
_L J.4LU 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttget 


gacaaagaag 


tttegtctae 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


ategcaaate 


eggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 14 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 14 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Glu Glu lie Glu 
15 10 15 



Arg Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 
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Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 ~ 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 ~ 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Pro Arg Ser Thr Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 
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Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 2 65 270 



Gly lie Asn Asp Ser Val Pro lie V*al Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr T;yr lie Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly lie Arg His Ser Arg Ala Pro Val Ser Lys Gly Leu Glu lie 
305 310 315 320 



lie Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp L^s Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Prie Pro Glu Thr Ala Asp Lys Lys 
385 390 395 ' 400 



Glu Pro He Gly Leu Ser Ala He Ptie Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He L^s Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 15 
<211> 1416 
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<212> UNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 15 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattgggagg 


agatcgagcg 


atggaaggac 


6 0 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


12 0 


gatgatttaa 


agaaagtcat 


taatcz tgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


18 0 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ccttaatgga 


ccccgacaat 


24 0 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


30 0 


tacgatatgg 


aagacccgct 


tcatcjaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


3G 0 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


42 0 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


gggatgggcg 


tccccaaaaa 


acagcttgat 


4S0 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


54=0 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagcc 


gcgcagcact 


6O0 


ccgcatctgg 


aagtcatccg 


catccjgaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


7S0 


ggagtgccgg 


tcggaaatca 


ggctg-tcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


gtgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


9O0 


tgtgatctgt 


cagaaggaat 


aaggcattcc 


cgtgctcctg 


1 1 tccaaaacr 






attgaagggc 


tgagaggtca 


tacc -fccaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


10S0 


aaagtgatct 


taagaaattt 


tgaa-orgtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctat tttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


12O0 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


12e50 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


agger gattct 


tcataa 






14H6 



<210> 16 
<211> 471 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Synthetic Construct 
<400> 16 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Glu Glu lie Glu 
15 10 15 



Arg Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Pro Arg Ser Thr Pro His Leu Glu Val He Arg He 
195 200 205 
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Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Val Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Ser Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 ~ 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 
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Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gl_n 
435 440 445 

Leu Lys GDLu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Trar 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 17 

<211> 1416 

<212> DN^\ 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 



<400> 17 
atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt 


atggaag-gac 


60 


gttccggaacg 


agaaatggaa 


cgattggctt 


tgacggctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaei 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctc 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttccg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg; 


aagacccgct 


tcatgaggat 


gaagatacac 


cggtacccgg 


tccgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gctccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattcg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgcc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg- 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttcca_att 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggag-atc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 
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atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 


18 


<211> 


471 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


18 



Met Lys Asn Lys Trp T^r Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Arg 
2 0 25 3 0 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Pro Leu Met Asp Pro Asp Asn 
65 7 0 75 ~ 8 0 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Thr Pro Val Pro Gly Pro Thr His Arg Tyr Pro Asp Arg Val Leu Pine 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
13 0 13 5 140 
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Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp C^s Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He A.rg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp HCis Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 2 55 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro T*lir Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 
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Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 4O0 



Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Sesr 
405 410 415 



Thr Pro Glu Asn Val Asp Arg lie Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Trir 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 19 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Synthetic Construct 










<400> 19 
atggaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt 


atggaag-gac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggoggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 
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aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


g-gtgaaccg-cg 


780 


cr cr a cr fc cr c c cr cr 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


ctgagggctt 


ggggcatttc 


cgtgctcctg 


tttccaaagg 


tttggaga.tc 


960 


attcraacrcrcrc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacQca 


1020 


ccacrcrccrcracr 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtcaca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtttac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagaga 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 20 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 20 

Met Glu Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu. 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin. 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 
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His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin Xle Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu. Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin Xle Leu Glu Tyr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val lie Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly Leu Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 
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Ile Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala -Asp Lys Lys 
385 390 395 400 



Glu Pro lie Gly Leu Ser Ala He Phe Ala Asp Lys Glu "Val Ser Phe 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Asp Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 21 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 21 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagfct 


atggaaggac 


60 


gtcccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgagg 


aggaaggccjt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


taccatgctt 


ctttaatgcja 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgoa 


caaaacaaaa 


300 
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tacgacatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tccgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggctcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcgtt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatctt 


720 


aacacaagca 


tcgaaatgac 


agaagaaccc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcgggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


tgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggagcc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 22 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 22 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Turp Gin 
20 25 30 
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Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Va 1 lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr His Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser GItjl Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 11 0 



Ser Pro Val Pro Gly Pro Thr His Arg Tyr Pro Asp Arg Va_l Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Ttir Arg Arg 
130 135 140 



Leu Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu. Glu Tyr 
180 185 19 0 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg Val Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Th_r His Leu 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Pro Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 27 0 
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Gly lie Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Cys Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 23 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic Construct 
<400> 23 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgag-tt 


atggaaagac 


60 


gttccggacg 


aaaagtggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


120 


gatgattcaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccactttctg 


aagaaatgoa 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


gccgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgca 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccgaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgatfcc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaacjg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttcgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaacccjc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


ggcaaagaag 


tttcgtcteic 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatcja 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 24 

<211> 471 

<212> PRT 

<213> Artificial Secjuence 
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<220> 

<223> Synthetic Construct 
<400> 24 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Tirp Lys Glu lie Glu 
1 ~ 5 10 15 



Leu Trp Lys Asp Val Pro Asp Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Ser Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro GILy Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val His Cys Arg H±s Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro GILu Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val lie Arg lie 
195 200 205 
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Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr A.sp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn rhr His Phe 
225 230 235 240 



Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys A^sp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu lie 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 3 50 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg A.sn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala A>sp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Gly Lys Glu 7al Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala T^yr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys A.zrg Gly Gin 
435 440 445 
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Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys GZLu Thx 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 25 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic Construct 
<400> 25 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atg-gaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aag-aacgttg 12 0 

gatgatttaa agaaagtcat taacctgacc gaggatgaag aggaaggcgt ccg-tatttct 18 0 

accaaaacga tccccttaaa tattacacct tactatgctt ctttaatgga ccccgacaaa 240 

ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 3 00 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 3 60 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 42 0 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 60 0 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 72 0 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 7 80 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 900 

tgtgacctgt cagaaggaat aaggcatttc cgtgctcctg tttc.caaagg tttggggatc 960 

attgaagggc tgggaggtca tacctcaggc tatgcggttc ctacctttgt cgttcacgca 1020 

ccaggcggag gaggtaaaat cgccctgcgg ccgaactatg tcctgtctca aagtcctgac 1080 

aaagtgatct taagaaattt tgaaggtgtg attacgtcat atccggaacc agagaattat 1140 

atccccaatc aggcagacgc ctattttgag tccgttttcc ctgaaaccgc tgacaagaag 1200 
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gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 26 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 26 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Lys 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 
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Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val lie Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly lie Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Gly lie 
305 310 315 320 



He Glu Gly Leu Gly Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Arg Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 
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Ala Asp Ala Tyr Phe Glu Ser Val E>lie Pro Glu Thr Ala Asp Lys Lys 
385 ~" 390 395 400 

Glu Pro lie Gly Leu Ser Ala He P>he Ala Asp Lys Glu Val Ser Ser 
405 410 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys i^sp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 27 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 27 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccgggag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 18 0 

accaaaacga tccccttaaa tattacacct tgctatgctc ctttaatgga ccccgacaac 240 

ccgagatgcc cggtacgcat gcagtctgtg ccgctttctg aagaaatgca caaaacaaaa 3 00 

tacgatatgg aagacccgct tcgtgaggat gaagattcac cggtacccgg tctgacacac 3 60 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 42 0 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 
gctgcaattg cttatatccg ggaaacaccc - gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacgg ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 72 0 

aacacaagcg tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 7 80 



WO 2006/047589 



PCT/US2005/038552 



-44- 



ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgeggtte 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


ggggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


Quay i — ci ci i L- 
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d LttyyaaCL 


agagaat tat 


1 1 yi n 

114U 


atccccaatc 


aggcagaege 


ctattttgag 


tccgttttcc 


ctggaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttget 


gacaaagaag 


tttegtctae 


acctgaaaat 


1260 


gtagacagaa 


teaaaeggeg 


tgaggcatac 


ategcaaate 


eggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


etttggegea 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgegg 


aggggattct 


tcataa 






1416 



<210> 28 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 28 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Gly Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr lie 
50 55 60 

Pro Leu Asn He Thr Pro Cys Tyr 2Vla Pro Leu Met Asp Pro Asp Asn 
65 7 0 75 8 0 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 
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His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu Arg Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu Leu lie Asn Gly Gin lie Leu Glu Tyr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val lie Arg lie 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser Val Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 
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Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val lie Leu Arg Asn Phe Glu 
355 360 365 



Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Gly Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro lie Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Ser Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 29 

<211> 1416 

<212> DNA- 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 29 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacggctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaag 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aggaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 
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cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccgc 


420 


tgcacacgcc 


ggcgcttttc 


cggacagatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


fctcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


Qgtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


g-gttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cctggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


acggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


=a sa rrf" /r^a 4— ri 4- 

aaay LyaLC l 


4~ a s /~r — ^ ~> — » 4— 4— 4— 

LaayaaatLU 


uyadyy ug ucj 


a u Lacy tcdu 


at ccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 30 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<2*20> 

<223> Synthetic Construct 
<400> 30 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Arg 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 
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Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Ser lie Ttar Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro GZLy Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg Arg Cys, Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 175 



lie Ser Gly Gly Asp Gly Leu. Leu lie Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val lie Arg lie 
195 200 205 



Gly Thr Arg Ala Pxro Val Val Phe Pro Gin Arg lie Thr Asp His Leu 
210 215 220 



Cys Glu lie Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 
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Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys -Asp Leu Ser 
290 295 300 



Glu Gly lie Arg His Phe Arg Ala Pro Val Ser Lys Gly Xeu Glu lie 
305 310 315 320 



lie Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val lie Leu Arg; Asn Phe Glu 
355 360 365 



Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie IPro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 -430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin 3Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 31 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<22 3> Synthetic Construct 
<400> 31 



atgaaaaaca 


aatggtataa 


accgaaaegg 


cattggaagg 


agatcgagtt 


atggaaggjac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacgcactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggegt 


ccgtattfcct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgega 


gcttaatgga 


tccagaaaac 


240 


ccacgttgtc 


eggtaegcat 


gcagtctgtg 


ccgctttctg 


aagaaatgea 


cacaagcaaa 


300 


tatgacatgg 


aagatccget 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgagtcaat 


gtcccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggegctttte 


eggacaaate 


ggaatgggcg 


tccccaaaaa 


acagcttcjat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggccjgt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gegcagcatt 


600 


ccgcatctgg 


gagtcatccg 


categgaaca 


cgtgctcccg 


tcgtctttcc 


geagegcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaga 


tatcatcegg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaaccjcg 


780 


ggagtgccgg 


teggaaatea 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


teatgeatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgeggtte 


ctacctttgt 


cgttcaccjca 


1020 


rpa erne cm Pi rr 


Vwj y v— j I — - C_-i- C_^LCJ_Ci L- 


p ctpp p t* cr p a cr 


pp eras pt a t~cr 


t p p t crt p t~ p a 




J- U O \J 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attaegtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagaege 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagecgateg 


ggctgagtgc 


catttttget 


gacaaagaag 


tttegtctae 


acctgaaaat 


1260 


gtagacagaa 


teaaaeggeg 


tgaggcatac 


ategcaaate 


eggagcatga 


aacattaaaa 


1320 


gateggegtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgegg 


aggggattct 


tcataa 






1416 



<210> 32 

<211> 471 

<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Synthetic Construct 
<400> 32 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu. 
1 5 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr Arg Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr Il« 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Glu Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Thr Ser Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Ased 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Ser Gin Cys Pro Val Tyr Cys Arg His Cys Thr Arg Arg- 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asrp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu He Arg Asp Cys Leu. 

165 170 175 



He Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin He Leu Glu Ty:rr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser He Pro His Leu Gly Val He Arg XU 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu. 
210 215 220 
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Cys Glu IILe Leu Lys Arg Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr S«sr lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
2^75 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly lie Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu lie 
305 310 315 320 



lie Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Lou Ser Gin Ser Pro Asp Lys Val lie Leu Arg Asn Phe Glu 
3 55 3 60 3 65 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro IILe Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro GILu Asn Val Asp Arg lie Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro GILu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 
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Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 33 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Synthetic Construct 



<400> 33 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacac actgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


gqaatggqcg 

;3 * ZJ ZJ ZD ZD 


tcccc aaaaa 


acacrc ttcxat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


actcjt ctgtt 


gtctggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtc tttcc 


gcagcgcatt 


660 


accgatcacc 


tgtgcgagat 


gttaaaaaaa 


tatcatccgg 


tctg-g-otgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaait^attc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtcct izatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacc-fcttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


gctgtttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 
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gtagacagaa tcaaacggcg tgaggcatac atcgcaaatc cggagcatga aacattaaaa 132 0 
gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgca gcagaaaaaa 13 8 0 
cagaaagaga ctgaatgcgg aggggattct tcataa 1416 



<210> 


34 


<211> 


471 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


34 



Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
1 5 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 3 0 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 
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Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg- Asp Cys Leu 
165 170 175 



Leu Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



lie Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 20 5 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Ttxr Asp His Leu 
210 215 220 



Cys Glu Met Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Va_l Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Me*t His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cy^s Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys GH_y Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 " 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arrg Asn Phe Glu 
355 360 3S5 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr I3_e Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr A3- a Asp Lys Lys 
385 " 390 395 400 
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Glu Pro He Gly Leu Ser Ala Leu Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 35 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 35 

atgaaaaaca aatggtataa accgaaacgg cattggaagg- agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tatcacacct tactatgcga. gcttaatgga tccagaaaac 240 

ccacgttgtc cggtacgcat gcagtctgtg ccgcttcctg; aagaaatgca caaaacaaaa 3 00 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 3 60 

cgctatcccg accgtgtgct gtttcttgtc acggatcaat gttccgtgta ctgccgccac 42 0 

cgcacacgcc ggcgcttctc cggacaaatc ggaatgggccj tccccgaaaa acagcttgat 480 

gctgcaattg cttacatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt: taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctccccj tcgtctttcc gcagcgcatt 660 

accgatcatc tgtgcgagat attgaaaaaa catcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat atgaaaagct ggtgaacgcg 780 

ggagtgccgg tcggaaatca ggctgttgta ttagcaggta ttaatgattc ggttccaatt 840 
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ataaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgacctgt 


cagaaggaat 


aaggcatttc 


cgtg-ctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tafcg-cggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


c».ci.cJ.y l. y cl i — - 1 — 




Lyaayy Ly uy 




ai — ttyyaaLt 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgrttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cg-gagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


tt ttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 36 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 36 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
2 0 25 3 0 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Sear Thr Lys Thr lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met: Asp Pro Glu Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu. Pro Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 
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Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asp Gin Cys Ser Val Tyr Cys Arg His Arg Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Glu Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 ' 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 

Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 

Cys Glu He Leu Lys Lys His His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 

Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Tyr Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He He Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 

Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 ' 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 
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Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val lie Leu Arg Asn Phe Glu 
355 360 365 



Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg lie Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu. Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 
<211> 
<212> 
<213> 


37 

1416 
DNA 

Artificial Sequence 








<220> 
<223> 


Synthetic Construct 








<400> 37 
atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt 


atggaaggac 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


cgctatccca 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 



60 
120 
180 
240 
300 
360 
420 
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tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


actgtctcjtt 


gtctggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gegcagcatt 


600 


ccgcatctgg 


aagtcattcg 


tatcggttct 


cgtgcgccag 


tcgtctt tec 


geagegcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctg-aa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


agggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtcaca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtttac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


eggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagaga 


tcagctcaaa 


gaaaagaaat 


ttttggccgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 38 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 38 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Gin lie Glu 
•1 5 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 

2 0 25 3 0 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 
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Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Tlir lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 9 0 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asn Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 

130 135 140 

Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 

145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



Leu Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leix Glu Val He Arg He 
195 200 205 



Gly Ser Arg Ala Pro Val Val Phe Pro Gin Argr He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val_ Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Glra Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 2 80 2 85 
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Val Lys lie Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Leu Ser 
290 295 300 

Glu Gly lie Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu GZLu lie 
305 310 315 320 



lie Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Tltir Phe 
325 330 3 35 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pzro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Prie Glu 
3 55 3 60 3 65 

Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Lsn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp L*:ys Lys 

385 390 395 400 

Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 

405 410 4= 15 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg TVsp Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 39 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



Synthetic Construct 
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<400> 39 
atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagg 


agatcgagtt 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgctt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccgctttctg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgca 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


t ccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcacctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcafc 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg- 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag gtggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggccigELcgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg ggctgagtgc 


catttttgct 


ggcaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagtcagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


-fctttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 40 

<211> 471 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 
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<400> 40 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 ^ 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val His Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 
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Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 ~" 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Gly Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Val Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 
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Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 41 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 41 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattgga-agg 


agatcgagtt 


atggagggac 


60 


gtcccggaag 


agaaatggaa 


cgattggctt 


tgacagc tga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatg-aag 


aggaaggegt 


cegtatttet 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatcj-ctt 


ctttaatgga 


ccccgacaat 


240 


ccgaggtgcc 


cggtacgcat 


gcagtctgtg 


ccactgtctg 


aggaaatgea 


caaaagcaaa 


300 


tatgacatgg 


aagatccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgcjgcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


y aL yyy ^ uy 


\~ Cri^r p\ r* ?k 
i— ci t— . k — do. ^ y ci 


{—< (~1 =1 =2 ^3 if— f— f— => 

v — > — ex. exca. i — ( — i — i— a 


/~rzs 3 j~ a f" r^r; 4- j- 4- 
y CLCL I- a. L- d L- C L- 


L-cica.cia.yc2.yi-. t- 


y 09 uagcau u 


ouu 


ccgcatctgg 


aagtcatccg 


categgaaca 


cgtgctcccg 


tcgtctttcc 


geagegcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatcegg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgagg-cat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagca^gta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgegg-tte 


ctacctttgt 


cgttcacgca 


1020 


ccgggcggag 


gaggtaaaat 


cgccctgcag 


ccgaac *tatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacg~tcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttget 


gacaaagaag 


tttegtctae 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggegtae 


ategcaaate 


eggagcatga 


aacattaaaa 


1320 



WO 2006/047589 



PCT/US2005/038552 



-67- 



gatcggcgtg agaaaagagg tcagctcaaa gaaaagaaat ttttggcgcs. gcagaaaaaa 13 80 
cagaaagaga ctgaatgcgg aggggattct tcataa 1416 



<210> 


42 


<211> 


471 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


42 



Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 



Leu Trp Arg Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 3 0 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Ser Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 X10 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg; Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
13 0 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 "* 160 



Ala Ala He Ala Tyr lie Arg Glu Thr Pro Glu lie Arg Asp Cys Leu 
165 170 ^ 17 5 
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Ile Ser Gly Gly Asp Gly Leu Leu lie Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 ~ 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 " 400 
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Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 

Thr Pro Glu Asn Val Asp Arg lie Lys Arg Arg Glu Ala Tyr lie Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 

Glu Cys Gly Gly Asp Ser Ser 



<210> 43 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 43 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 60 

gttccggaag agaaatggaa cgattggctt tgacagctga cacacactgt aagaacgtta 12 0 

gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 

accaaaacga tccccttaaa tattacacca tactatgcga gcttaatgga tccagaaaac 240 

ccacgttgtc cggtacgcat gcagtctgtg ccgctttccg aagaaatgca caaaacaaaa 3 00 

tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg tctgacacac 360 

cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 42 0 

tgcacacgcc ggcgcttttc cggacaaatc ggaatgggcg tccccaaaaa acagcttgat 480 

gctgcaattg cttatatccg ggaaacaccc gaaatccgcg attgtttaat ttcaggcggt 540 

gatgggctgc tcatcaacga ccaaatttta gaatatattt taaaagagct gcgcagcatt 600 

ccgcatctgg aagtcatccg catcggaaca cgtgctcccg tcgtctttcc gcagcgcatt 660 

accgatcatc cgtgcgagat attgaaaaaa tatcatccgg tctggctgaa cacccatttt 720 

aacacaagca tcgaaatgac agaagaatcc gttgaggcat gtgaaaagct ggtgaacgcg 7 80 

ggagtgccgg tcggaaatca ggctgtcgta ttagcaggta ttaatgattc ggttccaatt 840 

atgaaaaagc tcatgcatga cttggtaaaa atcagagtcc gtccttatta tatttaccaa 900 
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tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tctccaaagg- 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


taccccaggc 


tatgeggtte 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctcea. 


aagtcctgac 


1080 


aaagtgatct 


taaaaaahth 

w <taX-V^L <L-4» ShkL t^l V— 


t era acrcr t cr1~ cr 


a. i — u-civ^y i— l^cl i— 




agagaattat 


±14 0 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgtttccc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttget 


gacaaagaag 


tttegtctae: 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggectae 


ategcaaate 


eggagcatga. 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttteggegea 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 44 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 44 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys GILu lie Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Lou Trp Gin 
20 25 3 O 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys V^al He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr L*:ys Thr He 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp E>ro Glu Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser G-lu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu ^sp Glu Asp 
100 105 i_io 
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Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu. 

165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr~ 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg lie- 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Pro 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 ~ 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ale 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Senr 
290 295 300 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu Ilo 
305 310 315 32 0 



He Glu Gly Leu Arg Gly His Thr Pro Gly Tyr Ala Val Pro Thr Ph« 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asm 
340 345 350 
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Tyr Val Leu Ser Gin Ser Pro Asp Lys Val lie Leu Arg Asn Phe Glu 
355 360 365 



Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro -Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Ser Pro Glu Thr Ala Asp Lys Lys 
385 390 395 ^ 400 



Glu Pro lie Gly Leu Ser Ala lie Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Ar-g Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Ser Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 45 
<211> 1416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 45 

atgaaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt acggaaggac 60 
gttccggaag agaaatggaa cgattggctt tgacagctga cgcacactgt aagaacgtta 12 0 
gatgatttaa agaaagtcat taatctgacc gaggatgaag aggaaggcgt ccgtatttct 180 
accaaaacga tccccttaaa tattacacct tactatgcga gcttaattga tccagaaaac 240 
ccacgttgtc cggtacgcat gcagtctgcg ccgctgtctg aagaaatgca caaaacaaaa 300 
tacgatatgg aagacccgct tcatgaggat gaagattcac cggtacccgg; tctgacacac 3 60 
cgctatcccg accgtgtgct gtttcttgtc acgaatcaat gttccgtgta ctgccgccac 420 
tgcacacgcc ggcgcttttc cggacaaatc ggaacgggcg tccccaaaaa acagcttgat 480 
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gctgcaactg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


tccaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


ggatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgcccccg 


tcggctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tatgcggttc 


ctacctttgt 


cgttcacQca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


ccctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 46 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 46 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu. 
1 ~ 5 10 15 

Leu Arg Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Glxi 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asm 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr II o 
50 55 60 
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Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu He Asp Pro Glu Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Ala Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro "Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Thr Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala Thr Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Pro Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Gly Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Gly Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 ^ 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 
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Val Lys lie Arg Val Arg Pro Tyr Tyr He Tyr GUn Cys Asp Leu Ser 
290 295 3O0 



Glu Gly He Arg His Phe Arg Ala Pro Val Ser L^rs Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr AZLa Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He A Ha Leu Gin Pro Asn 
340 345 350 



Tyr Ala Leu Ser Gin Ser Pro Asp Lys Val He Lou Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro Asn Gin 
370 375 3 80 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu rhr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp L.ys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys X_>ys Gin Lys Glu Thr 
450 455 ^60 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 47 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Synthetic Construct 



<400> 47 

atggaaaaca aatggtataa accgaaacgg cattggaagg agatcgagtt atggaaggac 



60 
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gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactatgcga 


gcttaattga 


tccagaaaac 


240 


ccacgttgtc 


cggtacgcat 


gcagtctgtg 


ccgctttccg 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagatccgct 


tcatgaggat 


gaagattcac 


cggtac ccgg 


cctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


gcgaatcaat 


gttccg-tgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg 


cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtfc taat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatattt 


taaaagagct 


gcgcagcatt 


600 


ccgcatccgg 


aagtcatccg 


catcggaaca 


cgtgcccccg 


tcgtct ttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctggctgaa 


cacccatttt 


720 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggcat 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggta 


ttaatcjattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtcc 


gtccthatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgcccctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


tacctcaggc 


tgtgcggttc 


ctacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtzctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtcat 


atccggaacc 


agagaattat 


1140 


atccccaacc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagggg 


tcagctcaaa 


gaaaagaaat 


ttttgcjcgca 


gcagaaaaaa 


1380 


cagaaagaga 


ctgaatgcgg 


aggggattct 


tcataa 






1416 



<210> 48 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 48 
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Met Glu Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 3 0 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu lie Asp Pro Glu Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg- Val Leu Phe 
115 120 125 



Leu Val Ala Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg .Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr He Arg Glu Thr Pro Glu He Arcj Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Pro Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thirr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 
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Asn Thr Ser lie Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 2 50 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly lie Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys lie Arg Val Arg Pro Tyr Tyr Xle Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly lie Arg His Phe Arg Ala Pro \7al Ser Lys Gly Leu Glu He 

305 310 315 320 

He Glu Gly Leu Arg Gly His Thr Ser Gly Cys Ala Val Pro Thr Phe 

325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly 3Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys "Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 " "* 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Ser 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 ~ 460 
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Glu Cys Gly Gly Asp Ser Ser 
465 470 

<210> 49 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 49 



atgaaaaaca 


aatggtataa 


accgaaacgg 


cattggaagcj 


agatcgagtt 


atggaaggac 


60 


gttccggaag 


agaaatggaa 


cgattggctt 


tgacagctga 


cacacactgt 


aagaacgtta 


120 


gatgatttaa 


agaaagtcat 


taatctgacc 


gaggatgaag 


aggaaggcgt 


ccgtatttct 


180 


accaaaacga 


tccccttaaa 


tattacacct 


tactaggttt 


ctttaatgga 


ccccgacaat 


240 


ccgagatgcc 


cggtacgcat 


gcagtctgtg 


ccactgtctcj 


aagaaatgca 


caaaacaaaa 


300 


tacgatatgg 


aagacccgct 


tcatgaggat 


gaagattcac 


cggtacccgg 


tctgacacac 


360 


cgctatcccg 


accgtgtgct 


gtttcttgtc 


acgaatcaat 


gttccgtgta 


ctgccgccac 


420 


tgcacacgcc 


ggcgcttttc 


cggacaaatc 


ggaatgggcg- 


tccccaaaaa 


acagcttgat 


480 


gctgcaattg cttatatccg 


ggaaacaccc 


gaaatccgcg 


attgtttaat 


ttcaggcggt 


540 


gatgggctgc 


tcatcaacga 


ccaaatttta 


gaatatatt-fc 


taaaagagct 


gcgcagcatt 


600 


ccgcatctgg 


aagtcatccg 


catcggaaca 


cgtgctcccg 


tcgtctttcc 


gcagcgcatt 


660 


accgatcatc 


tgtgcgagat 


attgaaaaaa 


tatcatccgg 


tctcrcrc tcraa 


v — - CJ. v — v^. v — ■ CL I— * \ L-- 


7 9 0 


aacacaagca 


tcgaaatgac 


agaagaatcc 


gttgaggca-t 


gtgaaaagct 


ggtgaacgcg 


780 


ggagtgccgg 


tcggaaatca 


ggctgtcgta 


ttagcaggt^ 


ttaatgattc 


ggttccaatt 


840 


atgaaaaagc 


tcatgcatga 


cttggtaaaa 


atcagagtc c 


gtccttatta 


tatttaccaa 


900 


tgtgatctgt 


cagaaggaat 


aaggcatttc 


cgtgctcctg 


tttccaaagg 


tttggagatc 


960 


attgaagggc 


tgagaggtca 


cacctcaggc 


aatgcggttc 


ccacctttgt 


cgttcacgca 


1020 


ccaggcggag 


gaggtaaaat 


cgccctgcag 


ccgaactatg 


tcctgtctca 


aagtcctgac 


1080 


aaagtgatct 


taagaaattt 


tgaaggtgtg 


attacgtca_t 


atccggaacc 


agagaattat 


1140 


atccccaatc 


aggcagacgc 


ctattttgag 


tccgttttcc 


ctgaaaccgc 


tgacaaaaag 


1200 


gagccgatcg 


ggctgagtgc 


catttttgct 


gacaaagaag 


tttcgtctac 


acctgaaaat 


1260 


gtagacagaa 


tcaaacggcg 


tgaggcatac 


atcgcaaatc 


cggagcatga 


aacattaaaa 


1320 


gatcggcgtg 


agaaaagagg 


tcagctcaaa 


gaaaagaaat 


ttttggcgca 


gcagaaaaaa 


1380 
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cagaaagaga ctgaatgcgg aggggattct tcataa 1416 

<210> 50 
<211> 71 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 50 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thr Lys Thr He 
50 55 60 

Pro Leu Asn He Thr Pro Tyr 



65 


70 


<210> 


51 


<211> 


399 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


51 



Val Ser Leu Met Asp Pro Asp Asn Pro Arg Cys Pro Val Arg Met Gin 
1 5 10 15 



Ser Val Pro Leu Ser Glu Glu Met His Lys Thr Lys Tyr Asp Met Glu 
20 25 ' 30 



Asp Pro Leu His Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr His 
35 40 45 



Arg Tyr Pro Asp Arg Val Leu Phe Leu Val Thr Asn Gin Cys Ser Val 
50 55 60 
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Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ser Gly Gin lie Gly Met 
65 70 75 80 



Gly Val Pro Lys Lys Gin Leu Asp Ala Ala lie Ala Tyr lie Arg Glu 
85 90 95 



Thr Pro Glu lie Arg Asp Cys Leu lie Ser Gly Gly Asp Gly Leu Leu 
100 105 110 



lie Asn Asp Gin lie Leu Glu Tyr lie Leu Lys Glu Leu Arg Ser lie 
115 120 125 



Pro His Leu Glu Val lie Arg lie Gly Thr Arg Ala Pro Val Val Phe 
130 135 140 



Pro Gin Arg lie Thr Asp His Leu Cys Glu lie Leu Lys Lys Tyr His 
145 150 155 160 



Pro Val Trp Leu Asn Thr His Phe Asn Thr Ser lie Glu Met Thr Glu 
165 170 175 



Glu Ser Val Glu Ala Cys Glu Lys Leu Val Asn Ala Gly Val Pro Val 
180 185 190 



Gly Asn Gin Ala Val Val Leu Ala Gly lie Asn Asp Ser Val Pro lie 
195 200 205 



Met Lys Lys Leu Met His Asp Leu Val Lys He Arg Val Arg Pro Tyr 
210 215 220 



Tyr He Tyr Gin Cys Asp Leu Ser Glu Gly lie Arg His Phe Arg Ala 
225 ^ 230 235 240 



Pro Val Ser Lys Gly Leu Glu lie lie Glu Gly -Leu Arg Gly His Thr 
245 250 255 



Ser Gly Asn Ala Val Pro Thr Phe Val Val His Ala Pro Gly Gly Gly 
260 265 270 



Gly Lys lie Ala Leu Gin Pro Asn Tyr Val Leu Ser Gin Ser Pro Asp 
275 280 285 
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Lys Val He Leu Arg Asn Phe Glu Gly Val He Thr Ser Tyr Pro Glu 
290 295 300 



Pro Glu Asn Tyr He Pro Asn Gin Ala Asp Ala Tyr Phe Glu Ser Val 
305 310 315 320 



Phe Pro Glu Thr Ala Asp Lys Lys Glu Pro He Gly Leu Ser Ala He 
325 330 335 



Phe Ala Asp Lys Glu Val Ser Ser Thr Pro Glu Asn Val Asp Arg He 
340 345 350 



Lys Arg Arg Glu Ala Tyr He Ala Asn Pro Glu His Glu Thr Leu Lys 
355 360 365 



Asp Arg Arg Glu Lys Arg Gly Gin Leu Lys Glu Lys Lys Phe Leu Ala 
370 375 380 



Gin Gin Lys Lys Gin Lys Glu Thr Glu Cys Gly Gly Asp Ser Ser 
385 390 395 



<210> 52 

<211> 1245 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<220> 

<221> mis cofeature 

<223> This parental sequence is a modification of the wild-type KAM of 
Clostridium stricklandii 

<220> 

<221> CDS 

<222> (1) . . (1245) 

<400> 52 

atg agt tta aag gat aag ttt ttt aca cat gta age caa gaa gat tgg 48 
Met Ser Leu Lys Asp Lys Phe Phe Thr His Val Ser Gin Glu Asp Trp 
15 10 15 

aat gat tgg aaa tgg caa gta aga aat cgt ata aag act gtt gaa gaa 9 6 

Asn Asp Trp Lys Trp Gin Val Arg Asn Arg He Lys Thr Val Glu Glu 
20 25 30 

ctt aaa aaa tat att cca ctt act cca gaa gaa gaa gaa ggg gta aaa 144 
Leu Lys Lys Tyr He Pro Leu Thr Pro Glu Glu Glu Glu Gly Val Lys 
35 40 45 
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cgc tgt ctt gat aca tta cgt atg get att act cca tac tat eta teg 192 
Arg Cys Leu Asp Thr Leu Arg Met Ala lie Thr Pro Tyr Tyr Leu Ser 
5 0 5 5 60 

eta att gat gta gaa aat cca aat gac cct gta aga aag caa get gta 240 
Leu He Asp Val Glu Asn Pro Asn Asp Pro Val Arg Lys Gin Ala Val 
65 70 75 80 

cct ctt tct tta gag ctg cat cgc gca gcg tct gat atg gaa gac cca 2 88 

Pro Leu Ser Leu Glu Leu His Arg Ala Ala Ser Asp Met Glu Asp Pro 
85 90 95 

ctt cat gaa gat gga gat tct cca gtt cca gga ctt aca cat cgc tat 33 6 

Leu His Glu Asp Gly Asp Ser Pro Val Pro Gly Leu Thr His Arg- Tyr 
100 105 110 

cct gat cgc gtt ctt ctt tta atg act gat caa tgt tea gta tac tgc 3 84 

Pro Asp Arg Val Leu Leu Leu Met Thr Asp Gin Cys Ser Val Tyr Cys 
115 12 0 12 5 

cgc cac tgt act cgt aga cgc ttc get ggt cga aca gat tct get gtt 432 
Arg His Cys Thr Arg Arg Arg Phe Ala Gly Arg Thr Asp Ser Ala. Val 
130 135 140 

gat acg aag caa ata gat get gcg att gaa tat ate aaa aat act cca 480 
Asp Thr Lys Gin He Asp Ala Ala He Glu Tyr He Lys Asn Thir Pro 
145 150 155 160 

caa gta aga gac gtt eta ctt tea gga gga gat get eta tta ate tea 52 8 

Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu lie Ser 
165 170 175 

gat gaa aag ctt gag tac aca ate aga aga ctt cgt gaa ata cca. cac 57 6 

Asp Glu Lys Leu Glu Tyr Thr He Arg Arg Leu Arg Glu He Pro His 
180 185 190 

gtt gag gtt att cgt att gga tea cgt gta cca gtt gta atg cca. caa 624 
Val Glu Val He Arg He Gly Ser Arg Val Pro Val Val Met Pro Gin 
195 200 205 

cgt att aca cca gaa eta gtt tct atg ctt aaa aag tat cat cca. gta 672 
Arg He Thr Pro Glu Leu Val Ser Met Leu Lys Lys Tyr His Pro Val 
210 215 * 220 

tgg tta aat aca cac ttc aac cat cct aat gaa att act gaa gag tct 72 0 

Trp Leu Asn Thr His Phe Asn His Pro Asn Glu He Thr Glu Glu. Ser 
225 230 235 240 

aaa cgt gca tgt gag tta ctt get gat gca ggt att cct ctt gga. aat 7 68 

Lys Arg Ala Cys Glu Leu Leu Ala Asp Ala Gly He Pro Leu Gly Asn 
245 250 255 

caa agt gtg ctt ctt gca ggt gta aat gat tgc atg cac gtt atgg aaa 816 
Gin Ser Val Leu Leu Ala Gly Val Asn Asp Cys Met His Val Met Lys 
260 265 270 



aaa eta gta aat gac tta gtt aaa ata cgc gta cgt cct tac fcafc att 



864 
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Lys Leu Val Asn Asp Leu Val Lys lie Arg Val Arg Pro Tyr Tyr lie 
275 280 285 

tat caa tgt gac ctt tea gtt gga att gag cac ttt cgc act cca gtt 912 
Tyr Gin Cys Asp Leu Ser Val Gly lie Glu His Phe Arg Thr Pro Val 
290 295 300 

gca aag gga ata gaa ata att gaa ggc tta aga gga cat act tea gga 9 60 

Ala Lys Gly lie Glu lie lie Glu Gly Leu Arg Gly His Thr Ser Gly 
305 310 315 320 

tac tgc gtt cct aca ttt gtt gtg cat gca cct ggt ggt gga gga aaa 1008 
Tyr Cys Val Pro Thr Phe Val Val His Ala Pro Gly Gly Gly Gly Lys 
325 330 335 

act cca gtt atg cca aac tat gtt att tea caa aat cac aat aaa gtt 1056 
Thr Pro Val Met Pro Asn Tyr Val lie Ser Gin Asn His Asn Lys Val 
340 345 350 

att tta cgt aac ttt gaa ggt gta att aca act tac gat gag cct gat 1104 
lie Leu Arg Asn Phe Glu Gly Val lie Thr Thr Tyr Asp Glu Pro Asp 
355 360 365 

cat tat act ttc cac tgt gac tgt gat gta tgc act gga aaa aca aat 1152 
His Tyr Thr Phe His Cys Asp Cys Asp Val Cys Thr Gly Lys Thr Asn 
370 375 380 

gtt cat aag gtt gga gta get gga ctt eta aat gga gag aca gcg aca 12 00 

Val His Lys Val Gly Val Ala Gly Leu Leu Asn Gly Glu Thr Ala Thr 
385 390 395 400 

ctt gaa cct gag ggt ttg gaa aga aaa caa aga gga cat cac taa 1245 
Leu Glu Pro Glu Gly Leu Glu Arg Lys Gin Arg Gly His His 
405 410 



<210> 53 
<211> 414 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 53 

Met Ser Leu Lys Asp Lys Phe Phe Thr His Val Ser Gin Glu Asp Trp 
15 10 15 



Asn Asp Trp Lys Trp Gin Val Arg Asn Arg lie Lys Thr Val Glu Glu 
20 25 30 



Leu Lys Lys Tyr lie Pro Leu Thr Pro Glu Glu Glu Glu Gly Val Lys 
35 40 45 
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Arg Cys Leu Asp Thr Leu Arg Met Ala He Thr Pro Tyr Tyr Leu Ser 
50 55 60 



Leu He Asp Val Glu Asn Pro Asn Asp Pro Val Arg Lys Gin Ala Val 
65 70 75 ~ 80 



Pro Leu Ser Leu Glu Leu His Arg Ala Ala Ser Asp Met Glu Asp Pro 
85 90 95 



Leu His Glu Asp Gly Asp Ser Pro Val Pro Gly Leu Thr His Arg Tyr 
100 105 110 



Pro Asp Arg Val Leu Leu Leu Met Thr Asp Gin Cys Ser Val Tyr Cys 
115 12 0 125 



Arg His Cys Thr Arg Arg Arg Phe Ala Gly Arg Thr Asp Ser Ala Val 
130 135 140 



Asp Thr Lys Gin He Asp Ala Ala He Glu Tyr He Lys Asn Thr Pro 
145 150 155 160 



Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu He Ser 
165 170 175 



Asp Glu Lys Leu Glu Tyr Thr He Arg Arg Leu Arg Glu He Pro His 
180 185 190 



Val Glu Val He Arg He Gly Ser Arg Val Pro Val Val Met Pro Gin 
195 200 205 



Arg He Thr Pro Glu Leu Val Ser Met Leu Lys Lys Tyr His Pro Val 
210 215 220 



Trp Leu Asn Thr His Phe Asn His Pro Asn Glu He Thr Glu Glu Ser 
225 230 235 240 



Lys Arg Ala Cys Glu Leu Leu Ala Asp Ala Gly He Pro Leu Gly Asn 
245 250 255 



Gin Ser Val Leu Leu Ala Gly Val Asn Asp Cys Met His Val Met Lys 
260 265 270 



Lys Leu Val Asn Asp Leu Val Lys He Arg Val Arg Pro Tyr Tyr He 
275 280 285 
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Tyr Gin Cys Asp L<su Ser Val Gly lie Glu His Phe Arg Thr Pro Val 
290 295 300 



Ala Lys Gly lie Glu He He Glu Gly Leu Arg Gly His Thr Ser Gly 
305 310 315 320 



Tyr Cys Val Pro Trir Phe Val Val His Ala Pro Gly Gly Gly Gly Lys 
3 25 330 335 



Thr Pro Val Met Pro Asn Tyr Val He Ser Gin Asn His Asn Lys Val 
340 345 350 



lie Leu Arg Asn Phe Glu Gly Val He Thr Thr Tyr Asp Glu Pro Asp 
355 360 365 



His Tyr Thr Phe His Cys Asp Cys Asp Val Cys Thr Gly Lys Thr Asn 
370 375 380 



Val His Lys Val Gly Val Ala Gly Leu Leu Asn Gly Glu Thr Ala Thr 
385 390 395 400 



Leu Glu Pro Glu Gly Leu Glu Arg Lys Gin Arg Gly His His 
405 410 



<210> 54 

<211> 1251 

<212> DNA 

<213> Artificial! Sequence 
<220> 

<223> Synthetic Construct 

<220> 

<221> CDS 

<222> (1)..(1251) 

<400> 54 

atg gca gaa agt cgt aga aag tat tat ttc cct gat gtc acc gat gag 48 

Met Ala Glu Ser Arg Arg Lys Tyr Tyr Phe Pro Asp Val Thr Asp Glu 
15 10 15 

caa tgg tac gac tgg cat tgg cag gtc etc aat cga att aag acg etc 96 
Gin Trp Tyr Asp Trp His Trp Gin Val Leu Asn Arg He Lys Thr Leu 
20 25 30 



gac cag ctg aaa aag tac gtt aca etc acc get gaa gaa gaa gag gga 
Asp Gin Leu Lys Lys Tyr Val Thr Leu Thr Ala Glu Glu Glu Glu Gly 



144 
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35 40 45 

gta aaa gaa teg ccc aaa gta etc cga atg get ate aca cct tat tat 192 
Val Lys Glu Ser Pro Lys Val Leu Arg Met Ala He Thr Pro Tyr Tyr 
50 55 60 

ttg agt ttg ata gac ccc gag aat cct aat tgt ccg att cgt aaa caa 240 
Leu Ser Leu He Asp Pro Glu Asn Pro Asn Cys Pro He Arg Lys Gin 
65 70 75 80 

gec att cct act caa cag gaa ctg gta cgt get cct gaa gat cag gta 2 88 

Ala He Pro Thr Gin Gin Glu Leu Val Arg Ala Pro Glu Asp Gin Val 
85 90 95 

gac cca ctt agt gaa gat gaa gat teg ccc gta ccc gga ctg act cat 33 6 

Asp Pro Leu Ser Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr His 
100 105 110 

cgt tat ccg gat cgt gta ttg ttc ctt ate acg gac aaa tgt teg atg 3 84 

Arg Tyr Pro Asp Arg Val Leu Phe Leu He Thr Asp Lys Cys Ser Met 
115 120 125 

tac tgt cgt cat tgt act cgc cgt cgc ttc gca gga cag aaa gat get 432 
Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ala Gly Gin Lys Asp Ala 
130 135 140 

tct tct cct tct gag- cgc ate gat cga tgc att gac tat ata gec aat 480 
Ser Ser Pro Ser Glu Arg He Asp Arg Cys lie Asp Tyr He Ala Asn 
145 150 155 160 

aca ccg aca gtc cgc gat gtt ttg eta teg gga ggc gat gec etc ctt 52 8 

Thr Pro Thr Val Arg- Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu 
165 170 175 

gtc age gac gaa cgc ttg gaa tac ata ttg aag cgt ctg cgc gaa gta 57 6 

Val Ser Asp Glu Arc/ Leu Glu Tyr He Leu Lys Arg Leu Arg Glu Val 
180 185 190 

cct cat gtg gag att gtt cgt ata gga age cgt acg ccg gta gtc etc 624 
Pro His Val Glu' He Val Arg He Gly Ser Arg Thr Pro Val Val Leu 
195 200 205 

cct cag cgt ata acg- cct caa ttg gtg gat atg etc aaa aaa tat cat 672 
Pro Gin Arg He Thr- Pro Gin Leu Val Asp Met Leu Lys Lys Tyr His 
210 215 220 

ccg gtg tgg ctg a act act cac ttc aac cac ccg aat gaa gtt acc gaa 72 0 

Pro Val Trp Leu Asn Thr His Phe Asn His Pro Asn Glu Val Thr Glu 
225 230 235 240 

gaa gca gtg gag get tgt gaa aga atg gec aat gec ggt att ccg ttg 7 68 

Glu Ala Val Glu Ala. Cys Glu Arg Met Ala Asn Ala Gly He Pro Leu 
245 250 255 

ggt aac caa acg gtt tta ttg cgt gga ate aat gat tgt aca cat gtg 816 
Gly Asn Gin Thr Val Leu Leu Arg Gly He Asn Asp Cys Thr His Val 
260 265 270 
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atg aag aga ttg gta cat ttg ctg gta aag atg cgt gtg cgt cct tac 864 

Met Lys Arg Leu Val His Leu Leu Val Lys Met Arg Val Arg Pro Tyr 
2 75 280 285 

tat ata tat gta tgc gat ctt teg ctt gga ata ggt cat ttc cgc acg 912 

Tyr lie l?yr Val Cys Asp Leu Ser Leu Gly lie Gly His Phe Arg Thr 

290 295 3 00 

ccg gta tct aaa gga ate gaa att ate gaa aat ttg cgc gga cac acc 96 0 

Pro Val Ser Lys Gly lie Glu He He Glu Asn Leu Arg Gly His Thr 
305 310 315 ~ 320 



teg ggc tat gca gtt cct acc ttt gtg gta ggt get ccg ggg g-gt ggt 
Ser Gly Tyr Ala Val Pro Thr Phe Val Val Gly Ala Pro Gly Gly Gly 
325 330 335 



taa 



<210> 5 5 

<211> 416 

<212> PIRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 5 5 

Met Ala <31u Ser Arg Arg Lys Tyr Tyr Phe Pro Asp Val Thr Asp Glu 
1 5 10 15 

Gin Trp Tyr Asp Trp His Trp Gin Val Leu Asn Arg lie Lys Thr Leu 
20 25 30 



1008 



ggt aag ata cct gta acg ccg aac tat gtt gta tct cag tec oca cga 1056 
Gly Lys Xle Pro Val Thr Pro Asn Tyr Val Val Ser Gin Ser Pro Arg 

340 345 350 

cat gtg gtt ctt cgc aat tat gaa ggt gtt ate aca acc tat acg gag 1104 

His Val Val Leu Arg Asn Tyr Glu Gly Val He Thr Thr Tyr Thr Glu 
355 360 365 

ccg gag aat tat cat gag gag tgc gat tgt gag gac tgt cga ejee ggt 1152 

Pro Glu Asn Tyr His Glu Glu Cys Asp Cys Glu Asp Cys Arg Ala Gly 
370 375 380 

aag cat aaa gag ggt gta get gca ctt tec gga ggt cag cag ttg get 12 0 0 

Lys His X/ys Glu Gly Val Ala Ala Leu Ser Gly Gly Gin Gin Leu Ala 
385 390 395 400 

ate gag cct tec gac tta get cgc aaa aaa cgc aag ttt gat aag aac 1248 

lie Glu IPro Ser Asp Leu Ala Arg Lys Lys Arg Lys Phe Asp Hys Asn 
405 410 415 



1251 



Asp Gin Leu Lys Lys Tyr Val Thr Leu Thr Ala Glu Glu Glu Glu Gly 
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35 40 45 



Val Lys Glu Ser Pro Lys Val Leu Arg Met Ala lie Thr Pro Tyr Tyr 
50 55 60 



Leu Ser Leu He Asp Pro Glu Asn Pro Asn Cys Pro lie Arg Lys Gin 
65 70 75 80 



Ala lie Pro Trir Gin Gin Glu Leu Val Arg Ala Pro Glu Asp Gin Val 
85 90 95 



Asp Pro Leu Ser Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr His 
1O0 105 110 



Arg Tyr Pro Asp Arg Val Leu Phe Leu lie Thr Asp Lys Cys Ser Met 
115 120 125 



Tyr Cys Arg H±s Cys Thr Arg Arg Arg Phe Ala Gly Gin Lys Asp Ala 
130 135 140 



Ser Ser Pro Ser Glu Arg lie Asp Arg Cys lie Asp Tyr lie Ala Asn 
145 150 155 160 



Thr Pro Thr Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu Leu 
165 170 175 



Val Ser Asp Glu Arg Leu Glu Tyr lie Leu Lys Arg Leu Arg Glu Val 
ISO 185 190 



Pro His Val Glu lie Val Arg lie Gly Ser Arg Thr Pro Val Val Leu 
195 200 205 



Pro Gin Arg He Thr Pro Gin Leu Val Asp Met Leu Lys Lys Tyr His 
210 215 220 



Pro Val Trp Leu Asn Thr His Phe Asn His Pro Asn Glu Val Thr Glu 
225 230 235 240 



Glu Ala Val Glu Ala Cys Glu Arg Met Ala Asn Ala Gly lie Pro Leu 
245 250 255 



Gly Asn Gin Trir Val Leu Leu Arg Gly lie Asn Asp Cys Thr His Val 
260 265 270 
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Met Lys Arg Leu Val His Leu Leu Val Lys Met Arg Val Arg Pro Tyr 
275 280 285 



Tyr lie Tyr Val Cys Asp Leu Ser Leu Gly lie Gly His Phe Arg Thr 
290 295 300 



Pro Val Ser Lys Gly lie Glu He He Glu Asn Leu Arg- Gly His Thr 
305 310 315 320 



Ser Gly Tyr Ala Val Pro Thr Phe Val Val Gly Ala Pro Gly Gly Gly 
325 330 335 



Gly Lys He Pro Val Thr Pro Asn Tyr Val Val Ser Gin Ser Pro Arg 
340 345 350 



His Val Val Leu Arg Asn Tyr Glu Gly Val lie Thr Thzrr Tyr Thr Glu 
355 360 365 



Pro Glu Asn Tyr His Glu Glu Cys Asp Cys Glu Asp Cys Arg Ala Gly 
370 375 380 



Lys His Lys Glu Gly Val Ala Ala Leu Ser Gly Gly Gin. Gin Leu Ala 
385 390 395 400 



lie Glu Pro Ser Asp Leu Ala Arg Lys Lys Arg Lys Phe Asp Lys Asn 
405 410 415 



<210> 56 

<211> 1278 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<220> 

<221> CDS 

<222> (1) . . (1278) 

<400> 56 

atg aat aca gtt aat act cgt aaa aaa ttt ttc cca aat gta act gat 48 

Met Asn Thr Val Asn Thr Arg Lys Lys Phe Phe Pro Asn Val Thr Asp 

15 10 15 

gaa gaa tgg aat gat tgg aca tgg caa gta aaa aac cgc ctt aaa agt 96 

Glu Glu Trp Asn Asp Trp Thr Trp Gin Val Lys Asn Arg Leu Lys Ser 
20 25 30 
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gtt gaa gat tta gaa aaa tat gtt gat tta agt gaa gaa gaa aca gaa 144 
Val Glu Asp Leu Glu Lys Tyr Val Asp Leu Ser Glu Glu Glu Thr Glu 
35 40 45 

ggg gtt gta cgc act ctt gaa act tta cgt atg gca ate act cca ttt 192 
Gly Val Val Arg Thr Leu Glu Thr Leu Arg Met Ala lie Thr Pro Plie 
50 55 60 

tac ttc tea ttg ata gat ttg aat agt gat cgc tgc cca ata cgt aag 240 
Tyr Phe Ser Leu He Asp Leu Asn Ser Asp Arg Cys Pro He Arg Lys 
65 70 75 8 0 

caa get ata cct act ata cga gaa ata cat caa tct cjat get gat atg 2 88 

Gin Ala lie Pro Thr He Arg Glu He His Gin Ser A^sp Ala Asp Met 
85 90 95 

ttg gat cct eta cat gaa gat gaa gac tct cca gta oca gga tta act 336 
Leu Asp Pro Leu His Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr 
100 105 110 

cat cgc tat cca gat cgt gtt tta ctt eta ata aca gjac atg tgt tct 3 84 

His Arg Tyr Pro Asp Arg Val Leu Leu Leu lie Thr -A.sp Met Cys Ser 
115 120 125 

gta tac tgt cgc cac tgc act cgt cgc aga ttt get gjgg tea agt gat 432 
Val Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ala Gly Ser Ser Asp 
130 135 140 

ggt get atg cct atg gat aga att gac aaa gca ata cjaa tat att gca 480 
Gly Ala Met Pro Met Asp Arg He Asp Lys Ala He Glu Tyr He Ala 
145 150 155 160 

aaa act cca caa gta agg gat gta ttg tta tea gga g-ga gat gca ctt 52 8 

Lys Thr Pro Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu 
165 170 175 

eta gtt tct aat aaa aaa tta gaa age ata ate caa aaa eta cgc gca 57 6 

Leu Val Ser Asn Lys Lys Leu Glu Ser lie lie Gin Lys Leu Arg Ala 
180 185 190 

ata cct cat gtt gaa ata ate aga ata gga agt cgt sea cca gtt gtt 624 
He Pro His Val Glu He He Arg He Gly Ser Arg Thr Pro Val Val 
195 200 205 

tta cct caa aga att act cct gaa tta tgt aat atg -tta aag aaa tat 672 
Leu Pro Gin Arg He Thr Pro Glu Leu Cys Asn Met leu Lys Lys Tyr 
210 215 22 0 

cat cca att tgg atg aat act cat ttt aac cac cct caa gaa gta acg 72 0 

His Pro He Trp Met Asn Thr His Phe Asn His Pro Gin Glu Val Thr 
225 230 235 240 

cca gaa get aaa aaa get tgt gaa atg ttg gca gat gca gga gtt cca 7 68 

Pro Glu Ala Lys Lys Ala Cys Glu Met Leu Ala Asp Ala Gly Val Pro 
245 250 255 

tta gga aat caa act gta eta tta aga gga ata aat gac agt gta cct 816 
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Leu Gly Asn Gin Thr Val Leu Leu Arg Gly lie Asn Asp Ser Val Pro 
260 265 270 

gta atg aaa agg tta gta cat gat tta gta atg atg cgt gta cgc cct 864 
Val Met Lys Arg Leu Val His Asp Leu Val Met Met Arg Val Arg Pro 
275 280 285 

tat tat att tac caa tgt gac tta tct atg gga etc gaa cac ttc cgc 912 
Tyr Tyr lie Tyr Gin Cys Asp Leu Ser Met Gly Leu Glu His Phe Arg 
290 295 300 

aca cca gtt tct aaa ggt ata gaa att att gaa gga tta cgt gga cat 960 
Thr Pro Val Ser Lys Gly lie Glu lie lie Glu Gly Leu Arg Gly His 
305 310 315 320 

aca tct gga tat gca gta cca aca ttt gtt gtg cat gca cct ggt ggt 1008 
Thr Ser Gly Tyr Ala Val Pro Thr Phe Val Val His Ala Pro Gly Gly 
325 330 335 

gga gga aaa act cca gta atg cct caa tat gta att tct caa tct cct 1056 
Gly Gly Lys Thr Pro Val Met Pro Gin Tyr Val lie Ser Gin Ser Pro 
340 345 350 

cat cgt gta gtt tta cgc aac ttt gaa gga gtt ata aca act tat aca 1104 
His Arg Val Val Leu Arg Asn Phe Glu Gly Val lie Thr Thr Tyr Thr 
355 360 365 

gaa cca gaa aat tat aca cat gaa cct tgt tat gat gaa gaa aaa ttt 1152 
Glu Pro Glu Asn Tyr Thr His Glu Pro Cys Tyr Asp Glu Glu Lys Phe 
370 375 380 

gaa aaa atg tat gaa ata agt gga gtt tat atg eta gat gaa gga tta 12 00 

Glu Lys Met Tyr Glu lie Ser Gly Val Tyr Met Leu Asp Glu Gly Leu 
385 390 395 400 

gaa atg tea eta gaa cct age cac tta gca cgt cat gaa cgc aat aaa 1248 
Glu Met Ser Leu Glu Pro Ser His Leu Ala Arg His Glu Arg Asn Lys 
405 410 415 

aag aga gca gaa get gaa ggg aaa aaa taa 1278 
Lys Arg Ala Glu Ala Glu Gly Lys Lys 
420 425 



<210> 57 

<211> 425 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 57 



Met Asn Thr Val Asn Thr Arg Lys Lys Phe Phe Pro Asn Val Thr Asp 
15 10 15 
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Glu Glu Trp Asn Asp Trp Thr Trp Gin Val Lys Asn Arg Leu Lys Ser 
20 25 30 



Val Glu Asp Leu Glu Lys Tyr Val Asp Leu Ser Glu Glu Glu Thr Glu 
35 40 45 



Gly Val Val Arg Thr Leu Glu Thr Leu Arg Met Ala He Thr Pro Phe 
50 55 60 



Tyr Phe Ser Leu He Asp Leu Asn Ser Asp Arg Cys Pro He Arg Lys 
65 70 75 80 



Gin Ala He Pro Thr He Arg Glu Xle His Gin Ser Asp Ala Asp Met 
85 90 95 



Leu Asp Pro Leu His Glu Asp Glu Asp Ser Pro Val Pro Gly Leu Thr 
100 105 110 



His Arg Tyr Pro Asp Arg Val Leu Leu Leu He Thr Asp Met Cys Ser 
115 120 125 



Val Tyr Cys Arg His Cys Thr Arg Arg Arg Phe Ala Gly Ser Ser Asp 
130 135 140 



Gly Ala Met Pro Met Asp Arg He Asp Lys Ala He Glu Tyr lie Ala 
145 150 155 160 



Lys Thr Pro Gin Val Arg Asp Val Leu Leu Ser Gly Gly Asp Ala Leu 
165 170 175 



Leu Val Ser Asn Lys Lys Leu Glu Ser He He Gin Lys Leu Arg Ala 
180 185 190 



He Pro His Val Glu He He Arg Xle Gly Ser Arg Thr Pro Val Val 
195 200 205 



Leu Pro Gin Arg He Thr Pro Glu Leu Cys Asn Met Leu Lys Lys Tyr 
210 215 220 



His Pro He Trp Met Asn Thr His L^he Asn His Pro Gin Glu Val Thr 
225 230 235 240 



Pro Glu Ala Lys Lys Ala Cys Glu Met Leu Ala Asp Ala Gly Val Pro 
245 250 255 
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Leu Gly Asn Gin Thr Val Leu Leu Arg Gly He Asn Asp Ser Val Pro 
260 265 270 

Val Met Lys Arg Leu Val His Asp Leu Val Met Met Arg Val Arg Pro 
275 280 285 



Tyr Tyr He Tyr Gin Cys Asp Leu Ser Met Gly Leu Glu His Phe Arg 
290 295 300 



Thr Pro Val Ser Lys Gly He Glu He He Glu Gly Leu Arg Gly His 

305 310 315 " 320 

Thr Ser Gly Tyr Ala Val Pro Thr Phe Val Val His Ala Pro Gly Gly 

325 330 335 



Gly Gly Lys Thr Pro Val Met Pro Gin Tyr Val He Ser Gin Ser Pro 
340 345 350 



His Arg Val Val Leu Arg Asn Phe Glu Gly Val He Thr Thr Tyr Thr 
355 360 365 



Glu Pro Glu Asn Tyr Thr His Glu Pro Cys Tyr Asp Glu Glu Lys Phe 
370 375 380 



Glu Lys Met Tyr Glu He Ser Gly Val Tyr Met Leu Asp Glu Gly Leu 

385 390 395 400 

Glu Met Ser Leu Glu Pro Ser His Leu Ala Arg His Glu Arg Asn Lys 

405 410 ~ 415 



Lys Arg Ala Glu Ala Glu Gly Lys Lys 
420 425 



<210> 58 

<211> 1416 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Synthetic Construct 



<220> 
<221> 
<222> 



CDS 

(1) . . (1416) 
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<400> 58 

atg aaa aac aaa tgg tat aaa ccg aaa egg cat tgg aag gag ate gag 48 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 

tta tgg aag gac gtt ccg gaa gag aaa tgg aac gat tgg ctt tgg cag 9 6 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 

ctg aca cac act gta aga acg tta gat gat tta aag aaa gtc att aat 144 
Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 

ctg acc gag gat gaa gag gaa ggc gtc cgt att tct acc aaa acg ate 192 
Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr Lys Thr He 
50 55 60 

ccc tta aat att aca cct tac tat get tct tta atg gac ccc gac aat 240 
Pro Leu Asn He Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 

ccg aga tgc ccg gta cgc atg cag tct gtg ccg ctt tct gaa gaa atg 2 88 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

cac aaa aca aaa tac gat atg gaa gac ccg ctt cat gag gat gaa gat 33 6 

His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 

tea ccg gta ccc ggt ctg aca cac cgc tat ccc gac cgt gtg ctg ttt 3 84 

Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 

ctt gtc acg aat caa tgt tec gtg tac tgc cgc cac tgc aca cgc egg 432 
Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 

cgc ttt tec gga caa ate gga atg ggc gtc ccc aaa aaa cag ctt gat 48 0 

Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 

get gca att get tat ate egg gaa aca ccc gaa ate cgc gat tgt tta 52 8 

Ala Ala lie Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 

att tea ggc ggt gat ggg ctg etc ate aac gac caa att tta gaa tat 57 6 

He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He Leu Glu Tyr 
180 185 190 

att tta aaa gag ctg cgc age att ccg cat ctg gaa gtc ate cgc ate 624 
He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val He Arg He 
195 200 205 

gga aca cgt get ccc gtc gtc ttt ccg cag cgc att acc gat cat ctg 672 
Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 
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tgc gag ata 
Cys Glu He 
225 

aac aca age 
Asn Thr Ser 



ctg gtg aac 
Leu Val Asn 



ggt att aat 
Gly He Asn 
275 

gta aaa ate 
Val Lys He 
290 

gaa gga ata 
Glu Gly He 
305 

att gaa ggg 
lie Glu Gly 



gtc gtt cac 
Val Val His 



tat gtc ctg 
Tyr Val Leu 
355 

ggt gtg att 
Gly Val He 
370 

gca gac gec 
Ala Asp Ala 
385 

gag ccg ate 
Glu Pro He 



aca cct gaa 
Thr Pro Glu 



aat ccg gag 
Asn Pro Glu 
435 

etc aaa gaa 
Leu Lys Glu 



ttg aaa aaa 
Leu Lys Lys 
230 

ate gaa atg 
He Glu Met 
245 

gcg gga gtg 
Ala Gly Val 
260 

gat teg gtt 
Asp Ser Val 



aga gtc cgt 
Arg Val Arg 



ggg cat ttc 

Gly His Phe 
310 

ctg aga ggt 
Leu Arg Gly 
325 

gca cca ggc 
Ala Pro Gly 
340 

tea caa agt 
Ser Gin Ser 



acg tea tat 
Thr Ser Tyr 



tat ttt gag 
Tyr Phe Glu 
390 

ggg ctg agt 
Gly Leu Ser 
405 

aat gta gac 
Asn Val Asp 
420 

cat gaa aca 
His Glu Thr 



aag aaa ttt 
Lys Lys Phe 



tat cat ccg 
Tyr His Pro 



aca gaa gaa 
Thr Glu Glu 



ccg gtc gga 
Pro Val Gly 
265 

cca att atg 
Pro He Met 
280 

cct tat tat 
Pro Tyr Tyr 
295 

cgt get cct 
Arg Ala Pro 



cat ace tea 
His Thr Ser 



gga gga ggt 
Gly Gly Gly 
345 

cct gac aaa 
Pro Asp Lys 
360 

ccg gaa cca 
Pro Glu Pro 
375 

tec gtt ttc 
Ser Val Phe 



gee att ttt 
Ala He Phe 



aga ate aaa 
Arg lie Lys 
425 

tta aaa gat 
Leu Lys Asp 
440 

ttg gcg cag 
Leu Ala Gin 



gtc tgg ctg 
Val Trp Leu 
235 

tec gtt gag 
Ser Val Glu 
250 

aat cag get 
Asn Gin Ala 



aaa aag etc 
Lys Lys Leu 



att tac caa 
He Tyr Gin 
300 

gtt tec aaa 
Val Ser Lys 
315 

ggc tat gcg 
Gly Tyr Ala 
330 

aaa ate gee 
Lys He Ala 



gtg ate tta 
Val He Leu 



gag aat tat 
Glu Asn Tyr 
380 

cct gaa ace 
Pro Glu Thr 
395 

get gac aaa 
Ala Asp Lys 
410 

egg cgt gag 
Arg Arg Glu 



egg cgt gag 
Arg Arg Glu 



cag aaa aaa 
Gin Lys Lys 



aac acc cat 
Asn Thr His 



gca tgt gaa 
Ala Cys Glu 
255 

gtc gta tta 
Val Val Leu 
270 

atg cat gac 
Met His Asp 
285 

tgt gat ctg 
Cys Asp Leu 



ggt ttg gag 
Gly Leu Glu 



gtt cct acc 
Val Pro Thr 
335 

ctg cag ccg 
Leu Gin Pro 
350 

aga aat ttt 
Arg Asn Phe 
365 

ate ccc aat 
He Pro Asn 



get gac aaa 
Ala Asp Lys 



gaa gtt teg 
Glu Val Ser 
415 

gca tac ate 
Ala Tyr He 
430 

aaa aga gat 
Lys Arg Asp 
445 

cag aaa gag 
Gin Lys Glu 



ttt 720 

Phe 

240 

aag 7 68 

Lys 



gca 816 
Ala 



ttg 864 
Leu 



tea 912 
Ser 



ate 960 

He 

320 

ttt 1008 
Phe 



aac 1056 
Asn 



gaa 1104 
Glu 



cag 1152 
Gin 



aag 12 0 0 
400 

ttt 1248 
Phe 



gca 1296 
Ala 



cag 1344 
Gin 



act 1392 
Thr 
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450 455 460 



gaa tgc gga ggg gat tct tea taa 
Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 59 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 59 



Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu lie Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu. Glu Glu Gly Val Arg lie Ser Thr Lys Thr He 
50 55 60 



Pro Leu Asn He Th_x Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 



His Lys Thr Lys Ty-r Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 HO 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



1416 



Ala Ala lie Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
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3_65 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser lie Pro His Leu Glu Val lie Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 " 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg "Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu He 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys He Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 
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Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 415 

Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Asp Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 60 
<211> 471 
<212> PRT 

<213> lysine 2 , 3-aminomutase from Bacillus subtilis 
<400> 60 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg- His Trp Lys Glu He Glu 
15 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp> Asn Asp Trp Leu Trp Gin 
20 25 30 

Leu Thr His Thr Val Arg Thr Leu Asp Asp> Leu Lys Lys Val He Asn 
3 5 40 45 

Leu Thr Glu Asp Glu Glu Glu Gly Val Arg- He Ser Thr Lys Thr lie 
50 55 60 



Pro Leu Asn He Thr Pro Tyr Tyr Ala Serr Leu Met Asp Pro Asp Asn 
65 70 75 80 

Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 90 95 

His Lys Thr Lys Tyr Asp Leu Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 HO 



Ser Arg Val Pro Gly Leu Thr His Arg Tyrr Pro Asp Arg Val Leu Phe 
115 120 125 
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Leu Val Thr Asn Gin Cys Ser Met Tyr Cys Arg Tyr Cys Thr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr lie Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 1V0 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin lie Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro R±s Leu Glu Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu S<er Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met L^/s Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr I He Tyr Gin Cys Asp Leu Ser 
290 295 300 



Glu Gly He Gly His Phe Arg Ala Pro Vail Ser Lys Gly Leu Glu He 
305 "" 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 3 30 335 



Val Val Asp Ala Pro Gly Gly Gly Gly L:ys He Ala Leu Gin Pro Asn 
340 345 350 
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Tyr Val Leu Ser Gin Ser Pro Asp Lys Val lie Leu Arg Asn Phe Glu 
355 360 365 

Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 

Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 ' 390 395 400 

Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala. Tyr He Ala 
420 425 430 

Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Arg- Arg Asp Gin 
435 440 445 

Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Glrx Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 61 
<211> 471 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 
<400> 61 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
1^5 10 15 

Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val He Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg He Ser Thrcr Lys Thr He 
5 0 55 60 
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Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Pro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser Glu Glu Met 
85 9 0 95 



His Lys Thr Lys Tyr Asp Leu Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 110 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp -Arg Val Leu Phe 
115 120 0.25 



Leu Val Thr Asn Gin Cys Ser Met Tyr Cys Arg Tyr Cys Thr Arg Arg 
130 - 135 140 



Arg Phe Ser Gly Gin lie Gly Met Gly Val Pro Lys Lys Gin Leu Asp 
145 150 155 160 



Ala Ala lie Ala Tyr He Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin Tie Leu Glu Tyr 
180 185 190 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu "Val He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Thr His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala Cys Glu Lys 
245 250 255 



Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Leu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro He Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr He Tyr Gin Cys Asp Leu Ser 
290 295 300 
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Glu Gly lie Gly His Phe Arg Ala Pro Val Ser Lys Gly Leu Glu lie 
305 310 315 320 



lie Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Thr Phe 
325 330 335 



Val Val Asp Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Pro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val He Leu Arg Asn Phe Glu 
355 360 365 



Gly Val He Thr Ser Tyr Pro Glu Pro Glu Asn Tyr lie Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Ser Phe 
405 410 415 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Asp Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys Glu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 62 
<211> 471 
<212> PRT 

<213> Artifical Sequence 
<400> 62 

Met Lys Asn Lys Trp Tyr Lys Pro Lys Arg His Trp Lys Glu He Glu 
15 10 15 



Leu Trp Lys Asp Val Pro Glu Glu Lys Trp Asn Asp Trp Leu Trp Gin 
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20 25 30 



Leu Thr His Thr Val Arg Thr Leu Asp Asp Leu Lys Lys Val lie Asn 
35 40 45 



Leu Thr Glu Asp Glu Glu Glu Gly Val Arg lie Ser Thr L^s Thr lie 
50 55 60 



Pro Leu Asn lie Thr Pro Tyr Tyr Ala Ser Leu Met Asp Puro Asp Asn 
65 70 75 80 



Pro Arg Cys Pro Val Arg Met Gin Ser Val Pro Leu Ser GZLu Glu Met 
85 90 95 



His Lys Thr Lys Tyr Asp Met Glu Asp Pro Leu His Glu Asp Glu Asp 
100 105 1=L0 



Ser Pro Val Pro Gly Leu Thr His Arg Tyr Pro Asp Arg Val Leu Phe 
115 120 125 



Leu Val Thr Asn Gin Cys Ser Val Tyr Cys Arg His Cys Tlnr Arg Arg 
130 135 140 



Arg Phe Ser Gly Gin He Gly Met Gly Val Pro Lys Lys GHn Leu Asp 
145 150 155 160 



Ala Ala He Ala Tyr lie Arg Glu Thr Pro Glu He Arg Asp Cys Leu 
165 170 175 



He Ser Gly Gly Asp Gly Leu Leu He Asn Asp Gin He L«su Glu Tyr 
180 185 ISO 



He Leu Lys Glu Leu Arg Ser He Pro His Leu Glu Val I He Arg He 
195 200 205 



Gly Thr Arg Ala Pro Val Val Phe Pro Gin Arg He Thr Asp His Leu 
210 215 220 



Cys Glu He Leu Lys Lys Tyr His Pro Val Trp Leu Asn Ttir His Phe 
225 230 235 240 



Asn Thr Ser He Glu Met Thr Glu Glu Ser Val Glu Ala C^s Glu Lys 
245 250 255 
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Leu Val Asn Ala Gly Val Pro Val Gly Asn Gin Ala Val Val Lesu Ala 
260 265 270 



Gly He Asn Asp Ser Val Pro lie Met Lys Lys Leu Met His Asp Leu 
275 280 285 



Val Lys He Arg Val Arg Pro Tyr Tyr lie Tyr Gin Cys Asp Lesu Ser 
290 295 300 



Glu Gly lie Arg His Phe Arg Ala Pro Val Ser Lys Gly Leu Gl_u lie 
305 310 315 320 



He Glu Gly Leu Arg Gly His Thr Ser Gly Tyr Ala Val Pro Trar Phe 
325 330 335 



Val Val His Ala Pro Gly Gly Gly Gly Lys lie Ala Leu Gin Prro Asn 
340 345 350 



Tyr Val Leu Ser Gin Ser Pro Asp Lys Val lie Leu Arg Asn Ptae Glu 
355 360 365 



Gly Val lie Thr Ser Tyr Pro Glu Pro Glu Asn Tyr He Pro Asn Gin 
370 375 380 



Ala Asp Ala Tyr Phe Glu Ser Val Phe Pro Glu Thr Ala Asp Lys Lys 
385 390 395 400 



Glu Pro He Gly Leu Ser Ala He Phe Ala Asp Lys Glu Val Sesr Ser 
405 410 4X5 



Thr Pro Glu Asn Val Asp Arg He Lys Arg Arg Glu Ala Tyr He Ala 
420 425 430 



Asn Pro Glu His Glu Thr Leu Lys Asp Arg Arg Glu Lys Arg Gly Gin 
435 440 445 



Leu Lys Glu Lys Lys Phe Leu Ala Gin Gin Lys Lys Gin Lys GXu Thr 
450 455 460 



Glu Cys Gly Gly Asp Ser Ser 
465 470 



<210> 63 
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<211> 49 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Bacillus specific primer 



<220> 

<221> misc_f eature 
<223> Forward primer 

<400> 63 

ccagcctggc cataaggaga tatacatatg aaaaacaaat ggtataaac 49 



<210> 64 

<211> 50 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Bacillus specific primer 



<220> 

<221> misc_feature 
<223> Reverse primer 



<400> 64 

atggtgatgg tgatggtggc cagtttggcc ttatgaagaa tcccctccgc 



SO 



